معرفی شرکت ها


exfill-0.1.8


Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

A small app to grab job postings from online job boards
ویژگی مقدار
سیستم عامل -
نام فایل exfill-0.1.8
نام exfill
نسخه کتابخانه 0.1.8
نگهدارنده []
ایمیل نگهدارنده []
نویسنده jay-law
ایمیل نویسنده jay-law@users.noreply.github.com
آدرس صفحه اصلی https://github.com/jay-law/job-scraper
آدرس اینترنتی https://pypi.org/project/exfill/
مجوز -
# Introduction Job boards (like LinkedIn) can be a good source for finding job openings. Unfortunately the search results cannot always be filtered to a usable degree. Exfill (short for extraction) lets users scrape and parse jobs with more flexability provided by the default search. Currently only LinkedIn is supported. # Project Structure Directories: - `src/exfill/parsers` - Contains parser(s) - `src/exfill/scrapers` - Contains scraper(s) - `src/exfill/support` - Contains `geckodriver` driver for FireFox which is used by Selenium - Download the latest driver from the [Mozilla GeckoDriver repo in GitHub](https://github.com/mozilla/geckodriver) - `data/html` - Not in source control - Contains HTML elements for a specific job posting - Populated by a scraper - `data/csv` - Not in source control - Contains parsed information in a csv table - Populated by a parser - Also contains an error table - `logs` - Not in source control - Contains logs created during execution ## `creds.json` File Syntax should be as follows: ```json { "linkedin": { "username": "jay-law@protonmail.com", "password": "password1" } } ``` # Usage There are two `actions` required to generate usable data: First is the scraping action. When called, a browser will open and perform a job query on the specified `site`. Each posting will be exported to the `data/html` directory. The second action is parsing. Each job posting in `data/html` will be opened and analyzed. Once all postings have been analyzed a single CSV file will be exported to `data/csv`. The csv file provides a high-level overview of all the jobs returned during the query. When imported to a spreadsheet, users can filter on fields not present in the original search options. Examples include sorting by companies or excluding certain industries. ## Add Creds File This is required for all usage. ```bash # Install with git $ git clone git@github.com:jay-law/job-scraper.git # Create and populate creds.json. Bash only: cat <<EOF > creds.json { "linkedin": { "username": "jay-law@protonmail.com", "password": "password1" } } EOF ``` ## Use as Code ```bash # Install with git $ git clone git@github.com:jay-law/job-scraper.git # Activate virtual env $ poetry shell # Install dependencies $ poetry install # Ensure creds.json exists (see above) # Execute - Scrape linkedin $ python3 exfill/extractor.py linkedin scrape # Execute - Parse linkedin $ python3 exfill/extractor.py linkedin parse ``` ## Use as Module ```bash # Install $ poetry add exfill # Ensure creds.json exists (see above) # Execute - Scrape linkedin $ python3 -m exfill.extractor linkedin scrape # Execute - Parse linkedin $ python3 -m exfill.extractor linkedin parse ``` # Roadmap * [x] Write unit tests * [ ] Improve secret handling * [x] Add packaging * [x] Move paths to config file * [x] Move keyword logic * [x] Set/include default config.ini for users installing with PIP * [x] Add CICD * [x] Automate versioning * [x] Add formatter (black module) * [x] Add static type checking (mypy module) * [x] Add import sorter (isort module) * [x] Add linter (flake8 module) * [x] Update string interpolation from %f to f-string * [x] Replace sys.exit calls with exceptions * [x] Update how the config object is accessed * [x] Migrate to `poetry` for virtual env, building, and publishing * [x] Replace os.path usage with pathlib * [x] Replace pandas export with csv export * [x] Replace unittest with pytest


نیازمندی

مقدار نام
>=0.0.1,<0.0.2 bs4
>=4.1.5,<5.0.0 selenium
>=6.4.2,<7.0.0 setuptools-scm


زبان مورد نیاز

مقدار نام
>=3.8,<4.0 Python


نحوه نصب


نصب پکیج whl exfill-0.1.8:

    pip install exfill-0.1.8.whl


نصب پکیج tar.gz exfill-0.1.8:

    pip install exfill-0.1.8.tar.gz