Troubleshooting of Google cloud speech to text: Difference between revisions
Jump to navigation
Jump to search
m
Troubleshooting of Google cloud speech to text (edit)
Revision as of 10:52, 10 July 2020
, 10 July 2020→For audio longer than 1 min use LongRunningRecognize with a 'uri' parameter
Line 47: | Line 47: | ||
Solution: Specify the encoding of audio file. For details, see [https://cloud.google.com/speech-to-text/docs/encoding Introduction to Audio Encoding | Cloud Speech-to-Text API | Google Cloud] & [https://cloud.google.com/speech-to-text/docs/reference/rest/v1/RecognitionConfig#AudioEncoding RecognitionConfig | Cloud Speech-to-Text API | Google Cloud]. You may use VLC player to view the encoding of audio file<ref>[https://forum.videolan.org/viewtopic.php?t=95136#p315198 How to view audio bitrate in VLC - The VideoLAN Forums]</ref>. If the codec (encoding) of audio file is not in the allowed list on [https://cloud.google.com/speech-to-text/docs/reference/rest/v1/RecognitionConfig#AudioEncoding page], the codec (encoding) of audio file should be converted by [[Audio converter | audio converter]]. | Solution: Specify the encoding of audio file. For details, see [https://cloud.google.com/speech-to-text/docs/encoding Introduction to Audio Encoding | Cloud Speech-to-Text API | Google Cloud] & [https://cloud.google.com/speech-to-text/docs/reference/rest/v1/RecognitionConfig#AudioEncoding RecognitionConfig | Cloud Speech-to-Text API | Google Cloud]. You may use VLC player to view the encoding of audio file<ref>[https://forum.videolan.org/viewtopic.php?t=95136#p315198 How to view audio bitrate in VLC - The VideoLAN Forums]</ref>. If the codec (encoding) of audio file is not in the allowed list on [https://cloud.google.com/speech-to-text/docs/reference/rest/v1/RecognitionConfig#AudioEncoding page], the codec (encoding) of audio file should be converted by [[Audio converter | audio converter]]. | ||
== | == If the audio file's duration is longer than 1 minute use LongRunningRecognize with a 'uri' parameter == | ||
input | input | ||
<pre> | <pre> | ||
Line 84: | Line 84: | ||
</pre> | </pre> | ||
Solution: (1) | Solution: (1) If the audio file's duration is shorter than 1 min, use the uri: {{kbd | key=<nowiki>speech:recognize</nowiki>}}. (2) If the audio file's duration is longer than 1 min. Upload files to [https://console.cloud.google.com/storage/ Google cloud storage] (gcs). Modify the uri from {{kbd | key=<nowiki>speech:recognize</nowiki>}} to {{kbd | key=<nowiki>speech:longrunningrecognize</nowiki>}}. | ||
<pre> | <pre> | ||
$ curl -s -H "Content-Type: application/json" \ | $ curl -s -H "Content-Type: application/json" \ |