Usually includes .wav or .flac audio files along with ground-truth transcriptions and timestamped speaker labels.

The file is a compressed archive typically associated with the Quartet project , a well-known research dataset and benchmarking suite for evaluating speaker diarization and speech recognition systems. It often contains specific audio recordings, such as the "Two-person Dialogue" or "Four-person Meeting" subsets used by developers and researchers to test how well AI can distinguish between different voices.

The Quartet02.7z file typically provides a standardized set of audio data that researchers use to benchmark their algorithms. By using the same data, developers can directly compare the "Diarization Error Rate" (DER) of different models.