How ML Models Search Audio with Text Queries
 · 5 min read
In this post, I will be explaining everything you need to know about the Language-Based Audio Retrieval architecture. This architecture is designed to retrieve audio clips based on natural language queries. It uses a combination of audio and text encoders to achieve this.