معرفی شرکت ها

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

Transcription APIs encapsulation

ویژگی	مقدار
سیستم عامل	-
نام فایل	ftvstt-2.0.0
نام	ftvstt
نسخه کتابخانه	2.0.0
نگهدارنده	[]
ایمیل نگهدارنده	[]
نویسنده	France Télévisions innovations et développement
ایمیل نویسنده	pierre-andre.long@francetv.fr
آدرس صفحه اصلی	-
آدرس اینترنتی	https://pypi.org/project/ftvstt/
مجوز	-

# Description ftvstt is a France Télévisions python library which encapsulates multiple online speech-to-text APIs, in order to call them as easily as possible. It currently supports : - [x] Amazon Transcribe - [x] Google Cloud speech-to-text - [x] Vocapia Voxsigma - [x] Bertin Mediaspeech # Quickstart ftvstt is not currently available through pip, you have to download the package and import it directly : ftvstt is currently available through pip by running : **pip install ftvstt**. You should then be able to import the package : ```python import ftvstt ``` # Usage Example of transcription through the services: Vocapia Voxsigma: ```python vocapiaTranscriber = ftvstt.Vocapia("https://rest1.vocapia.com:8093/voxsigma") vocapiaTranscriber.authenticate("EXAMPLE_ID","EXAMPLE_PASS") transcript = vocapiaTranscriber.transcribe("/path/to/file.wav") vocapiaTranscriber.deauthenticate() ``` Bertin Mediaspeech: ```python bertinTranscriber = ftvstt.Bertin("https://demo02.mediaspeech.com:4433/api") bertinTranscriber.authenticate("EXAMPLE_ID","EXAMPLE_PASS") transcript = bertinTranscriber.transcribe("/path/to/file.wav") bertinTranscriber.deauthenticate() ``` Amazon transcribe: ```python amazonTranscriber = ftvstt.Amazon("AMAZON_S3_BUCKET_NAME") amazonTranscriber.authenticate("AMAZON_AWS_ID_KEY", "AMAZON_AWS_SECRET_KEY") transcript = amazonTranscriber.transcribe("/path/to/file.wav") amazonTranscriber.deauthenticate() ``` You can also authenticate directly with path to the credentials csv file: ```python amazonTranscriber.authenticate_with_file("/path/to/amazon/credentials.csv") ``` You need an amazon AWS S3 bucket besides Amazon AWS Transcribe in order to make transcriptions. If your file is already on a S3 bucket, use instead: ```python transcript = amazonTranscriber.transcribe("https://url/to/s3/file.wav", s3file=True) ``` Google cloud speech-to-text: ```python googleTranscriber = ftvstt.Google() googleTranscriber.authenticate("GOOGLE_CLIENT_EMAIL", "GOOGLE_PIVATE_KEY") transcript = googleTranscriber.transcribe("/path/to/file.wav") googleTranscriber.deauthenticate() ``` You can also authenticate directly with path to client service credentials json file with google. ```python googleTranscriber.authenticate_with_file("/path/to/google/credentials.json") ``` # Custom vocabulary file For every provider except Bertin, you can add a custom vocabulary file of probable words as shown : ```python googleTranscriber = ftvstt.Google() googleTranscriber.authenticate_with_file("/path/to/google/credentials.json") googleTranscriber.set_vocabulary_file("/path/to/vocabulary/file.txt") transcript = googleTranscriber.transcribe("/path/to/file.wav") googleTranscriber.deauthenticate() ``` The vocabulary file should be of the form: ``` word1 word2 word3 ... ``` # Results handling Once a transcription is done, the transcribe function of a Transcriber returns a Transcript instance from ftvstt.transcripts sub-module. A Transcript instance, as transcript in previous codes, has several useful attributes : **transcript.text**: a string containing the textual transcript of the audio file. **transcript.words**: a list of Word instances from ftvstt.transcripts sub-module, each one has a content (str), a startTime (float), an endTime (float), a speaker (Speaker instance from ftvstt.transcripts sub-module) (and can have a confidence (float) depending on the provider used) attribute. **transcript.speakers**: a list of Speaker instances from ftvstt.transcripts sub-module, each one has an id (int), (and can have a gender (str : "M" or "F") depending on the provider used). **transcript.raw**: a string containing the raw result of the transcription received from the provider, which type is transcript.rawType (str : "json" or "xml"). You can also dump the results in a normalised (the same format for any provider) json file: ```python transcript.dump_normalised_result("/output/path/to/normalized/result.json") ``` You can also load a transcript from a raw result file: ```python with open("/path/to/bertin/raw/result.xml") as file: raw = file.read() transcript = ftvstt.Transcript(raw=raw, provider=ftvstt.Bertin) ``` # Error handling If an error has occured during transcription, a custom python Exception from the ftvstt.exceptions sub-module will be raised. The error will also be accessbile in the exception attribute of the transcript result, as you can see in this example: ```python googleTranscriber = ftvstt.Google() googleTranscriber.authenticate("/path/to/google/credentials.json") try: transcript = googleTranscriber.transcribe("/path/to/file.wav") except: pass raise transcript.exception googleTranscriber.deauthenticate() ``` # Testing Coming soon...

نیازمندی

مقدار	نام
==1.10.4	boto3
==1.2.0	google-cloud-speech
==2.22.0	requests

نحوه نصب

نصب پکیج whl ftvstt-2.0.0:

pip install ftvstt-2.0.0.whl

نصب پکیج tar.gz ftvstt-2.0.0:

pip install ftvstt-2.0.0.tar.gz