معرفی شرکت ها


dataherb-0.1.5


Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

Get clean datasets from DataHerb to boost your data science and data analysis projects
ویژگی مقدار
سیستم عامل -
نام فایل dataherb-0.1.5
نام dataherb
نسخه کتابخانه 0.1.5
نگهدارنده []
ایمیل نگهدارنده []
نویسنده Lei Ma
ایمیل نویسنده hi@leima.is
آدرس صفحه اصلی https://github.com/DataHerb/dataherb-python
آدرس اینترنتی https://pypi.org/project/dataherb/
مجوز MIT
<h1 align="center"> <br> <a href="https://dataherb.github.io"><img src="https://raw.githubusercontent.com/DataHerb/dataherb.github.io/master/assets/favicon/ms-icon-310x310.png" alt="Markdownify" width="200"></a> <br> The Python Package for DataHerb <br> </h1> <h4 align="center">A <a href="https://dataherb.github.io" target="_blank">DataHerb</a> Core Service to Create and Load Datasets.</h4> <p align="center"> </p> ## Install ``` pip install dataherb ``` Documentation: [dataherb.github.io/dataherb-python](https://dataherb.github.io/dataherb-python) ## The DataHerb Command-Line Tool > Requires Python 3 The DataHerb cli provides tools to create dataset metadata, validate metadata, search dataset in flora, and download dataset. ### Search and Download Search by keyword ``` dataherb search covid19 # Shows the minimal metadata ``` Search by dataherb id ``` dataherb search -i covid19_eu_data # Shows the full metadata ``` Download dataset by dataherb id ``` dataherb download covid19_eu_data # Downloads this dataset: http://dataherb.io/flora/covid19_eu_data ``` ### Create Dataset Using Command Line Tool We provide a template for dataset creation. Within a dataset folder where the data files are located, use the following command line tool to create the metadata template. ```bash dataherb create ``` ### Upload dataset to remote Within the dataset folder, run ```bash dataherb upload ``` ### UI for all the datasets in a flora ```bash dataherb serve ``` ## Use DataHerb in Your Code ### Load Data into DataFrame ``` # Load the package from dataherb.flora import Flora # Initialize Flora service # The Flora service holds all the dataset metadata use_flora = "path/to/my/flora.json" dataherb = Flora(flora=use_flora) # Search datasets with keyword(s) geo_datasets = dataherb.search("geo") print(geo_datasets) # Get a specific file from a dataset and load as DataFrame tz_df = pd.read_csv( dataherb.herb( "geonames_timezone" ).get_resource( "dataset/geonames_timezone.csv" ) ) print(tz_df) ``` ## The DataHerb Project ### What is DataHerb DataHerb is an open-source data discovery and management tool. - A **DataHerb** or **Herb** is a dataset. A dataset comes with the data files, and the metadata of the data files. - A **Herb Resource** or **Resource** is a data file in the DataHerb. - A **Flora** is the combination of all the DataHerbs. In many data projects, finding the right datasets to enhance your data is one of the most time consuming part. DataHerb adds flavor to your data project. By creating metadata and manage the datasets systematically, locating an dataset is much easier. Currently, dataherb supports sync dataset between local and S3/git. Each dataset can have its own remote location. ### What is DataHerb Flora We desigined the following workflow to share and index open datasets. ![DataHerb Workflow](https://raw.githubusercontent.com/DataHerb/dataherb.github.io/master/assets/images/dataherb-components.png) > The repo [dataherb-flora](https://github.com/DataHerb/dataherb-flora) is a demo flora that lists some datasets and demonstrated on the website [https://dataherb.github.io](https://dataherb.github.io). At this moment, the whole system is being renovated. ## Development 1. Create a conda environment. 2. Install requirements: `pip install -r requirements.txt` ## Documentation The source of the documentation for this package is located at `docs`. ## References and Acknolwedgement - `dataherb` uses `datapackage` in the core. `datapackage` is a python library for the [data-package standard](https://specs.frictionlessdata.io/data-package/). The core schema of the dataset is essentially the data-package standard.


نیازمندی

مقدار نام
>=0.23 pandas
>=2.22.0 requests
>=0.2.2 rapidfuzz
>=0.16.10 ruamel.yaml
>=7.0 click
>=2.6.3 inquirer
>=0.4.3 colorama
>=3.1.0 GitPython
>=0.5.3 loguru
>=1.15.2 datapackage
==1.20.13 awscli
<0.16,>=0.10 docutils
==7.1.8 mkdocs-material
==5.0.2 python-slugify
==0.5.12 mkdocs-macros-plugin
>=10.7.0 rich
>=2.4.1 sphinx
>=0.4.3 sphinx-rtd-theme


نحوه نصب


نصب پکیج whl dataherb-0.1.5:

    pip install dataherb-0.1.5.whl


نصب پکیج tar.gz dataherb-0.1.5:

    pip install dataherb-0.1.5.tar.gz