معرفی شرکت ها


amazon-textract-helper-0.0.4


Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر
Card image cap
تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

Amazon Textract Helper tools
ویژگی مقدار
سیستم عامل -
نام فایل amazon-textract-helper-0.0.4
نام amazon-textract-helper
نسخه کتابخانه 0.0.4
نگهدارنده []
ایمیل نگهدارنده []
نویسنده Amazon Rekognition Textract Demoes
ایمیل نویسنده rekognition-textract-demos@amazon.com
آدرس صفحه اصلی https://github.com/aws-samples/amazon-textract-textractor/tree/master/helper
آدرس اینترنتی https://pypi.org/project/amazon-textract-helper/
مجوز Apache License Version 2.0
# Textractor-Textract-Helper amazon-textract-helper provides a collection of ready to use functions and sample implementations to speed up the evaluation and development for any project using Amazon Textract. It installs a command line tool called ```amazon-textract``` # Install ```bash > python -m pip install amazon-textract-helper ``` Make sure your environment is setup with AWS credentials through configuration files or environment variables or an attached role. (https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html) # Test ```bash > amazon-textract --help usage: amazon-textract [-h] (--input-document INPUT_DOCUMENT | --example | --stdin) [--features {FORMS,TABLES} [{FORMS,TABLES} ...]] [--pretty-print {WORDS,LINES,FORMS,TABLES} [{WORDS,LINES,FORMS,TABLES} ...]] [--pretty-print-table-format {csv,plain,simple,github,grid,fancy_grid,pipe,orgtbl,jira,presto,pretty,psql,rst,medi awiki,moinmoin,youtrack,html,unsafehtml,latex,latex_raw,latex_booktabs,latex_longtable,textile,tsv}] [--overlay {WORD,LINE,FORM,KEY,VALUE,TABLE,CELL} [{WORD,LINE,FORM,KEY,VALUE,TABLE,CELL} ...]] [--pop-up-overlay-output] [--overlay-output-folder OVERLAY_OUTPUT_FOLDER] [--version] [--no-stdout] [-v | -vv] optional arguments: -h, --help show this help message and exit --input-document INPUT_DOCUMENT s3 object (s3://) or file from local filesystem --example using the example document to call Textract --stdin receive JSON from stdin --features {FORMS,TABLES} [{FORMS,TABLES} ...] features to call Textract with. Will trigger call to AnalyzeDocument instead of DetectDocumentText --pretty-print {WORDS,LINES,FORMS,TABLES} [{WORDS,LINES,FORMS,TABLES} ...] --pretty-print-table-format {csv,plain,simple,github,grid,fancy_grid,pipe,orgtbl,jira,presto,pretty,psql,rst,mediawiki,moinmoin,youtrac k,html,unsafehtml,latex,latex_raw,latex_booktabs,latex_longtable,textile,tsv} which format to output the pretty print information to. Only effects FORMS and TABLES --overlay {WORD,LINE,FORM,KEY,VALUE,TABLE,CELL} [{WORD,LINE,FORM,KEY,VALUE,TABLE,CELL} ...] defines what bounding boxes to draw on the output --pop-up-overlay-output shows image with overlay --overlay-text shows image with WORD or LINE text overlay. When both WORD and LINE overlay are specified, WORD text will be overlayed --overlay-confidence shows image with confidence overlay --overlay-output-folder OVERLAY_OUTPUT_FOLDER output with bounding boxes to folder --version print version information --no-stdout no output to stdout -v >=INFO level logging output to stderr -vv >=DEBUG level logging output to stderr ``` # Sample Commands ## Easy Start ```bash > amazon-textract --example ``` this will run the examples document using the DetectDocumentText API. Output will be printed to stdout and look similar to this: ```json {"DocumentMetadata": {"Pages": 1}, "Blocks": [{"BlockType": "PAGE", "Geometry": {"BoundingBox": {"Width": 1.0, "Height": 1.0, "Left": 0.0 , "Top": 0.0}, "Polygon": [{"X": 9.33321120033382e-17, "Y": 0.0}, {"X": 1.0, "Y": 1.6069064689339292e-16}, {"X": 1.0, "Y": 1.0}], "HTTPHeaders": {"x-amzn-requestid": "12345678-1234-1234-1234-123456789012", "content-type": "a pplication/x-amz-json-1.1", "content-length": "48177", "date": "Thu, 01 Apr 2021 21:50:29 GMT"}, "RetryAttempts": 0}} ``` It is working. ## Call with document on S3 ```bash > amazon-textract --input-document "s3://somebucket/someprefix/someobjectname.png" ``` Output similar to Easy Start ## Call with document on local file system ```bash > amazon-textract --input-document "./somepath/somefilename.png" ``` Output similar to Easy Start We will continue to use the ```--example``` parameter to keep it simple and easy to reproduce. S3 and local files work the same way, just instead of --example use --input-document <location>. ## Call with STDIN ```bash # first create JSON amazon-textract --example > example.json # now use a stored JSON with the ```amazon-textract``` command cat example.json | amazon-textract --stdin -pretty-print LINES ``` ## Call with FORMS and TABLES ```bash > amazon-textract --example --features FORMS TABLES ``` This will call the [AnalyzeDocument API] (https://docs.aws.amazon.com/textract/latest/dg/API_AnalyzeDocument.html) and output will include Output will look similar to "Easy Start" but include FORMS and TABLES information ## Pretty print the output Pretty print outputs nicely formatted information for words, lines, forms or tables. For example to print the tables identified by Amazon Textract to stdout, use ```bash > amazon-textract --example --features TABLES --pretty-print TABLES ``` Output will look like this: ```text |------------|-----------|---------------------|-----------------|-----------------------| | | | Previous Employment | History | | | Start Date | End Date | Employer Name | Position Held | Reason for leaving | | 1/15/2009 | 6/30/2011 | Any Company | Assistant Baker | Family relocated | | 7/1/2011 | 8/10/2013 | Best Corp. | Baker | Better opportunity | | 8/15/2013 | present | Example Corp. | Head Baker | N/A, current employer | ``` to pretty print both, FORMS and TABLES: ```bash > amazon-textract --example --features FORMS TABLES --pretty-print FORMS TABLES ``` will output ```text Phone Number:: 555-0100 Home Address:: 123 Any Street, Any Town, USA Full Name:: Jane Doe Mailing Address:: same as home address |------------|-----------|---------------------|-----------------|-----------------------| | | | Previous Employment | History | | | Start Date | End Date | Employer Name | Position Held | Reason for leaving | | 1/15/2009 | 6/30/2011 | Any Company | Assistant Baker | Family relocated | | 7/1/2011 | 8/10/2013 | Best Corp. | Baker | Better opportunity | | 8/15/2013 | present | Example Corp. | Head Baker | N/A, current employer | ``` ## Overlay **At the moment overlay only works with images, we will add support for PDF soon.** The following command runs DetectDocumentText, pretty prints the WORDS in the document to stdout and draws bounding boxes around each WORD and displays the result in a popup window and stores it to a folder called 'overlay-output-folder-name'. ```bash amazon-textract --example --pretty-print WORDS --overlay WORD --pop-up-overlay-output --overlay-output-folder overlay-output-folder-name ``` <img src="https://github.com/aws-samples/amazon-textract-textractor/blob/master/helper/docs/employmentapp_boxed_WORD_.png" alt="Sample overlay WORD" width="50%" height="50%" border="1"> The following command runs AnalyzeDocument for FORMS and TABLES, pretty prints FORMS and TABLES to to stdout and draws bounding boxes around each TABLE-CELL and FORM KEY/VALUE and displays the result in a popup window and stores it to a folder called 'overlay-output-folder-name'. ```bash > amazon-textract --example --features TABLES FORMS --pretty-print FORMS TABLES --overlay FORM CELL --pop-up-overlay-output --overlay-output-folder ../mywonderfuloutputfolderfordocs/ ``` <img src="https://github.com/aws-samples/amazon-textract-textractor/blob/master/helper/docs/employmentapp_boxed_FORM_CELL_.png" alt="Sample overlay FORM CELL" width="50%" height="50%" border="1"> The following command draws bounding boxes around each WORD, overlays the detected WORD text, and displays the result in a popup window and stores it to a folder called 'overlay-output-folder-name'. ```bash > amazon-textract --example --overlay WORD --overlay-text --pop-up-overlay-output --overlay-output-folder overlay-output-folder-name ``` <img src="https://github.com/aws-samples/amazon-textract-textractor/blob/master/helper/docs/employmentapp_boxed_WORD_TEXT_OVERLAY.png" alt="Sample overlay LINE with overlay text and confidence percentage" width="50%" height="50%" border="1"> The following command draws bounding boxes around each LINE, overlays LINE text along with percentage confidence of the detected LINE text, and displays the result in a popup window and stores it to a folder called 'overlay-output-folder-name'. ```bash > amazon-textract --example --overlay LINE --overlay-text --overlay-confidence --pop-up-overlay-output --overlay-output-folder overlay-output-folder-name ``` <img src="https://github.com/aws-samples/amazon-textract-textractor/blob/master/helper/docs/employmentapp_boxed_LINE_TEXT_OVERLAY.png" alt="Sample overlay LINE with overlay text and confidence percentage" width="50%" height="50%" border="1">


نیازمندی

مقدار نام
- boto3
- botocore
>=0.1.40 amazon-textract-response-parser
>=0.0.27 amazon-textract-caller
>=0.0.10 amazon-textract-overlayer
>=0.1.0 amazon-textract-prettyprinter
>=9.2.* Pillow
>=2.5.* PyPDF2


زبان مورد نیاز

مقدار نام
>=3.6 Python


نحوه نصب


نصب پکیج whl amazon-textract-helper-0.0.4:

    pip install amazon-textract-helper-0.0.4.whl


نصب پکیج tar.gz amazon-textract-helper-0.0.4:

    pip install amazon-textract-helper-0.0.4.tar.gz