You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are several APIs available to convert text to speech in Python. One of such APIs is the Google Text to Speech API commonly known as the gTTS API. gTTS is a very easy to use tool which converts the text entered, into audio which can be saved as a mp3 file.
An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features include speech-to-text with Nemo, text generation with Mistral-7B, DuckDuckGo search integration, and text-to-speech with edge-tts, all in a user-friendly Gradio interface.
Jetbot Voice to Action Tools is a set of ROS2 nodes that utilize the Jetson Automatic Speech Recognition (ASR) deep learning interface library for NVIDIA Jetson
This project aims to assist visually impaired individuals by providing a solution to convert images into spoken language. Leveraging deep learning and natural language processing, the system processes images, generates descriptive captions, and converts these captions into audio output.
Its a Tool for creating announcement sound files from an excel file and an exported audio track. and generates an announcement for that in Hindi and English language to help people.
readLites makes reading fun and easy! It uses smart technology to show pictures and pick out important ideas from stories. This helps you understand and enjoy books better!
Repository to save the code for the software and hardware prototype for Lego Yoshi, a proposed expansion to the Lego Super Mario interactivity using OpenAI's GPT 3.5 Turbo API
This project uses YOLO v8 for real-time object detection in images, providing voice feedback on the detected objects' positions. Powered by the gTTS API, it offers seamless integration and enhanced user experience.