Add voice-to-xdotool: hands-free speech typing via VAD + Whisper + xdotool
New tool that uses webrtcvad for voice activity detection, faster-whisper for transcription, and xdotool to type into any focused window. Supports session-based listening, configurable silence threshold, and a "full stop" magic word to auto-submit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
6
python/tool-speechtotext/xdotool.sh
Executable file
6
python/tool-speechtotext/xdotool.sh
Executable file
@@ -0,0 +1,6 @@
|
||||
#!/bin/bash
|
||||
export CT2_CUDA_ALLOW_FP16=1
|
||||
|
||||
# 'mamba run' executes the command within the context of the environment
|
||||
# without needing to source .bashrc or shell hooks manually.
|
||||
mamba run -n whisper-ollama python ~/family-repo/Code/python/tool-speechtotext/voice_to_xdotool.py "$@"
|
||||
Reference in New Issue
Block a user