What audio formats are supported in Conversation to Text?
Overview
This article lists the audio and video file formats supported by the Conversation to Text module.
Applies to
All Users
Supported audio formats
The Conversation to Text module supports commonly used audio file formats including MP3, WAV, OGG, M4A, AAC.
For the formats listed, only WAV is a fully lossless/uncompressed format. MP3, OGG, M4A, and AAC are lossy compression formats, meaning they discard data to reduce file size. WAV provides perfect audio replication but with large file sizes, while the others are optimized for streaming and storage.
Format Breakdown:
WAV (Lossless/Uncompressed): Stores all original audio data, commonly used for high-fidelity audio editing.
MP3 (Lossy): Widely compatible but compresses data significantly, losing audio quality.
AAC (Lossy): Often superior to MP3 at similar bitrates, used by YouTube and Apple.
M4A (Lossy): Often uses the AAC codec, functioning as a modern alternative to MP3.
OGG (Lossy): Similar to MP3 but open-source and used by platforms like Spotify.
Supported video formats
Video files containing audio are also accepted, including MP4. The audio track is extracted from the video for transcription purposes. As with audio files, higher quality recordings produce better transcription results.
Zoom and Teams recordings
Recordings exported from Zoom are typically in MP4 format. Teams recordings are generally in MP4 format as well. Both are supported by the Conversation to Text module.
Unsupported formats
If you have a recording in a format not listed above, you can convert it to a supported format using free tools such as Audacity (for audio) or HandBrake (for video) before uploading. Contact Dictalogic support if you are unsure whether your file format is supported.