What audio formats are supported in Conversation to Text?

Overview

This article lists the audio and video file formats supported by the Conversation to Text module.

Applies to

All Users

Supported audio formats

The Conversation to Text module supports commonly used audio file formats including MP3, WAV, OGG, M4A, AAC.

For the formats listed, only WAV is a fully lossless/uncompressed format. MP3, OGG, M4A, and AAC are lossy compression formats, meaning they discard data to reduce file size. WAV provides perfect audio replication but with large file sizes, while the others are optimized for streaming and storage. 

Format Breakdown:

WAV (Lossless/Uncompressed): Stores all original audio data, commonly used for high-fidelity audio editing.

MP3 (Lossy): Widely compatible but compresses data significantly, losing audio quality.

AAC (Lossy): Often superior to MP3 at similar bitrates, used by YouTube and Apple.

M4A (Lossy): Often uses the AAC codec, functioning as a modern alternative to MP3.

OGG (Lossy): Similar to MP3 but open-source and used by platforms like Spotify.

Supported video formats

Video files containing audio are also accepted, including MP4. The audio track is extracted from the video for transcription purposes. As with audio files, higher quality recordings produce better transcription results.

Zoom and Teams recordings

Recordings exported from Zoom are typically in MP4 format. Teams recordings are generally in MP4 format as well. Both are supported by the Conversation to Text module.

Unsupported formats

If you have a recording in a format not listed above, you can convert it to a supported format using free tools such as Audacity (for audio) or HandBrake (for video) before uploading. Contact Dictalogic support if you are unsure whether your file format is supported.

Leave a Reply 0

Your email address will not be published. Required fields are marked *