Google Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes more than 120 languages and variants to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or prerecorded audio, using Google's machine learning technology.

Required Ruby Version

>= 2.7

Authors

Google LLC

Versions

  1. 2.2.0 June 11, 2026 (14.5 KB)
  2. 2.1.0 March 20, 2026 (14.5 KB)
  3. 2.0.4 September 12, 2025 (14.5 KB)
  4. 2.0.3 August 29, 2025 (14 KB)
  5. 2.0.2 May 27, 2025 (14 KB)
  6. 1.7.0 February 26, 2024 (16.5 KB)
Show all versions (56 total)

Pushed by

SHA 256 checksum