معرفی شرکت ها


TopDownHockey-Scraper-2.1.8


Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

The TopDownHockey Scraper
ویژگی مقدار
سیستم عامل OS Independent
نام فایل TopDownHockey-Scraper-2.1.8
نام TopDownHockey-Scraper
نسخه کتابخانه 2.1.8
نگهدارنده []
ایمیل نگهدارنده []
نویسنده Patrick Bacon
ایمیل نویسنده patrick.s.bacon@gmail.com
آدرس صفحه اصلی https://github.com/TopDownHockey/TopDownHockey_Scraper
آدرس اینترنتی https://pypi.org/project/TopDownHockey-Scraper/
مجوز MIT
# TopDownHockey EliteProspects Scraper ## By Patrick Bacon, made possible by the work of Marcus Sjölin and Harry Shomer. --- This is a package built for scraping two data sources: 1. The NHL's Play-by-Play Reports, which come in the form of HTML/API reports from the NHL and XML reports from ESPN. 2. Elite Prospects, an extremely valuable website which makes hockey data for thousands of leagues available to the public. This package is strictly built for end users who wish to scrape data for personal use. If you are interested in using Elite Prospects data for professional purposes, I recommend you look into the <a href="https://www.eliteprospects.com/api" >Elite Prospects API</a>. While using the scraper, please be mindful of EliteProspects servers. # Installation --- You can install the package by entering the following command in terminal: <code>pip install TopDownHockey_Scraper</code> If you're interested in using the NHL Play-By-Play scraper, import that module using this function in Python: <code>import TopDownHockey_Scraper.TopDownHockey_NHL_Scraper as tdhnhlscrape</code> If you're interested in using the Elite Prospects scraper, import that module using this function in Python: <code>import TopDownHockey_Scraper.TopDownHockey_EliteProspects_Scraper as tdhepscrape</code> # User-End Functions (NHL Scraper) --- ### scrape_schedule(start_date, end_date) Returns the NHL's schedule from the API for all games played between a start date and an end date. <ul> <li>start_date: The first date in the list of game dates that you would like to scrape. Enter as a string in "YYYY-MM-DD" format.</li> <li>end_date: The last date in the list of game dates that you would like to scrape. Enter as a string in "YYYY-MM-DD" format.</li> </ul> Example: <code>tdhnhlscrape.scrape_schedule("2021-01-01", "2021-05-20")</code> --- ### full_scrape(game_id_list, shift = True) Returns a dataframe containing play-by-play data for a list of game ids. <ul> <li>game_id_list: A list of NHL game ids.</li> <li>shift: Shift the coordinate source to ESPN. By default, the program will attempt to scrape the NHL's API for location coordinates first.</li> </ul> Example: <code>tdhnhlscrape.full_scrape([2020020014, 2020020015, 2020020016])</code> Combine the two functions and scrape the entire 2021 regular season: - <code>schedule_2021 = tdhnhlscrape.scrape_schedule("2021-01-01", "2021-05-20")</code> - <code>schedule_2021 = schedule_2021[schedule_2021.type=='R']</code> - <code>game_list_2021 = list(schedule_2021.ID)</code> - <code>pbp_2021 = tdhnhlscrape.full_scrape(game_list_2021)</code> # User-End Functions (Elite Prospects Scraper) --- ### get_skaters(leagues, seasons) Returns a dataframe containing statistics for all skaters in a target set of league(s) and season(s). <ul> <li>leagues: One or multiple leagues. If one league, enter as a string i.e; "nhl". If multiple leagues, enter as a tuple or list i.e; ("nhl", "ahl").</li> <li>seasons: One or multiple leagues. If one league, enter as a string i.e; "2018-2019". If multiple leagues, enter as a tuple or list i.e; ("2018-2019", "2019-2020").</li> </ul> Example: <code>tdhepscrape.get_skaters(("nhl", "ahl"), ("2018-2019", "2019-2020"))</code> --- ### get_goalies(leagues, seasons) Returns a dataframe containing statistics for all goalies in a target set of league(s) and season(s). <ul> <li>leagues: One or multiple leagues. If one league, enter as a string i.e; "nhl". If multiple leagues, enter as a tuple or list i.e; ("nhl", "ahl").</li> <li>seasons: One or multiple leagues. If one league, enter as a string i.e; "2018-2019". If multiple leagues, enter as a tuple or list i.e; ("2018-2019", "2019-2020").</li> </ul> Example: <code>tdhepscrape.get_goalies("khl", "2015-2016")</code> --- ### get_player_information(dataframe) Returns a dataframe containing bio information for all skaters or goalies (or both) within a target dataframe. <ul> <li>dataframe: The dataframe returned by one of the previous two commands.</li> </ul> Example: Say you obtain skater data for the KHL in 2020-2021 and store that as a dataframe called <code>output</code>. You can run this function to get bio information for every player in that league's scrape. <code>output = tdhepscrape.get_skaters("khl", "2020-2021")</code> <code>tdhepscrape.get_player_information(output)</code> --- ### add_player_information(dataframe) Returns a dataframe containing bio information for all skaters or goalies (or both) within a target dataframe as well as the statistics from the original dataframe. <ul> <li>dataframe: The dataframe returned by one of the previous two commands.</li> </ul> Example: Say you obtain skater data for the KHL in 2020-2021 and store that as a dataframe called <code>output</code>. You can run this function to get bio information for every player in that league's scrape. <code>output = tdhepscrape.get_skaters("khl", "2020-2021")</code> <code>tdhepscrape.add_player_information(output)</code> # Comments, Questions, and Concerns. --- My goal was to make this package as error-proof as possible. I believe I've accounted for every issue that could potentially throw off a scrape, but it's possible I've missed something. If any issues arise, or you have any questions about the package, please do not hesitate to contact me on Twitter at @TopDownHockey or email me directly at patrick.s.bacon@gmail.com.


نیازمندی

مقدار نام
- numpy
- pandas
- bs4
- datetime
- seaborn
- matplotlib
- xmltodict
- lxml
- natsort


زبان مورد نیاز

مقدار نام
>=3.6 Python


نحوه نصب


نصب پکیج whl TopDownHockey-Scraper-2.1.8:

    pip install TopDownHockey-Scraper-2.1.8.whl


نصب پکیج tar.gz TopDownHockey-Scraper-2.1.8:

    pip install TopDownHockey-Scraper-2.1.8.tar.gz