-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Labels
AdvancedComplex changes needing deep architectural understanding.Complex changes needing deep architectural understanding.SSoC25Part of Social Summer of Code 2025. Tag for recognition.Part of Social Summer of Code 2025. Tag for recognition.enhancementNew feature or requestNew feature or request
Description
Description:
Enable users to speak natural language commands using their microphone, which PromptShell will convert to text and then process as a query. This makes the tool more accessible, hands-free, and faster to use in some workflows.
Expected Behavior:
- Add a CLI flag:
--listen- PromptShell will:
- Activate the microphone
- Convert speech to text (via offline/online models)
- Show transcribed text as a normal query
- Send it to the LLM
- Display the result
Dependencies:
- Python library: speech_recognition
Test case example:
$ promptshell --listen
🎤 Listening... (press Ctrl+C to stop)
User said: "List all Python files modified in the last 2 days"
Response:
find . -name "*.py" -mtime -2Challenges
- Cross-platform microphone access
- Error handling if mic access fails
- Optional fallback to text if no speech detected
Benefits:
- Greatly improves accessibility
- Useful for hands-free use cases (e.g., DevOps, on-the-go scripting)
- Helps differently-abled users interact with the shell via voice
Metadata
Metadata
Assignees
Labels
AdvancedComplex changes needing deep architectural understanding.Complex changes needing deep architectural understanding.SSoC25Part of Social Summer of Code 2025. Tag for recognition.Part of Social Summer of Code 2025. Tag for recognition.enhancementNew feature or requestNew feature or request