Limitations in AI research and multimodal pipelines have limited possibilities with multimodal language translation and accessibility. Shellie democratizes access to knowledge with an open-source universal react component for universal speech translation. Features: Automatic speech recognition for nearly 100 languages Speech-to-text translation for nearly 100 input and output languages Speech-to-speech translation, supporting nearly 100 input languages and 35 (+ English) output languages Text-to-text translation for nearly 100 languages Text-to-speech translation, supporting nearly 100 input languages and 35 (+ English) output languages Implementation: Built as a React component widget Uses JavaScript to call Gradio API endpoints from Huggingface Huggingface Runs Inference on Multi-Modal SeamlessM4T Model UI returns text or audio file in one click
Category tags:Utility and Tools, Productivity, Language and Translation