New tool that uses webrtcvad for voice activity detection, faster-whisper for transcription, and xdotool to type into any focused window. Supports session-based listening, configurable silence threshold, and a "full stop" magic word to auto-submit. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
7 lines
280 B
Bash
Executable File
7 lines
280 B
Bash
Executable File
#!/bin/bash
|
|
export CT2_CUDA_ALLOW_FP16=1
|
|
|
|
# 'mamba run' executes the command within the context of the environment
|
|
# without needing to source .bashrc or shell hooks manually.
|
|
mamba run -n whisper-ollama python ~/family-repo/Code/python/tool-speechtotext/voice_to_xdotool.py "$@"
|