2025 RealTimeTranscribeAI 
GitHub Link: https://github.com/MrPumpkinsss/RealTimeTranscribeAI
RealTimeTranscribeAI
RealTimeTranscribeAI is an enhanced version of the original Ecoute project with an added AI reply feature. This tool simultaneously captures audio from your microphone (user input) and speakers (system output), transcribes it in real time, and uses the Deepseek API to generate professional AI responses.
Features
Dual Audio Capture
Record audio concurrently from both the microphone and the speakers.- Real-Time Transcription
Choose between two transcription modes:- Local Transcription: Powered by the Faster Whisper model.
- API Transcription: Uses OpenAI’s transcription API.
AI-Powered Reply Generation
Generate concise, interview-style responses based on the transcribed text using the Deepseek API.User-Friendly Graphical Interface
The CustomTkinter-based GUI displays live transcriptions, allows you to clear the transcript, and shows AI-generated replies.- Automatic Ambient Noise Adjustment
Automatically calibrates for ambient noise to enhance transcription accuracy.
Table of Contents
Installation
System Requirements
- Python Version: Python 3.8 or higher.
- Operating System: Primarily designed for Windows (speaker capture utilizes WASAPI loopback devices; other platforms may require additional configuration).
- External Tools:
FFmpeg — Ensure theffmpegcommand is available in your system PATH for proper audio processing.
Python Dependencies
Install the required Python packages using pip:
pip install -r requirements.txt
Deepseek API Configuration
Before running RealTimeTranscribeAI, edit the following lines in the main.py file to set up your Deepseek API details:
api_key = "Your_Deepseek_API_Key"
base_url = "https://api.deepseek.com"
chat_model = "deepseek-chat"
Replace "Your_Deepseek_API_Key" with your actual Deepseek API key. You can also modify the INITIAL_DEEPSEEK_PROMPT variable to customize the initial prompt for AI-generated replies.
Usage
- Complete the installation and API configuration steps.
- Launch the application with the following command:
python main.py - Once started, the graphical interface will display the live transcription and AI responses. Use the interface buttons to clear the transcript or adjust settings as needed.
