🤗 HuggingFaceSignificantChenyang Lyu
LongSpeech: Benchmark Dataset for Long-Form Speech Understanding
LongSpeech is a new benchmark dataset for evaluating audio LLMs on long-form speech understanding. Contains 100,000+ segments averaging ~10 minutes each, spanning 8 evaluation tasks: ASR, translation, summarization, spea…