Commit Graph

12 Commits

Author SHA1 Message Date
local
104da381fb Refactor tool-speechtotext: extract sttlib shared library and add tests
Extract duplicated code (Whisper loading, audio recording, transcription,
VAD processing) into reusable sttlib/ package. Rewrite all 3 scripts as
thin wrappers. Add 24 unit tests with mocked hardware. Fix GPU fallback
bug in assistant.py and args.system assignment bug.
2026-02-08 00:40:31 +00:00
local
848681087e Add voice-to-xdotool: hands-free speech typing via VAD + Whisper + xdotool
New tool that uses webrtcvad for voice activity detection, faster-whisper
for transcription, and xdotool to type into any focused window. Supports
session-based listening, configurable silence threshold, and a "full stop"
magic word to auto-submit.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-06 23:37:14 +00:00
local
370e97d08d tidyup 2026-01-19 23:07:59 +00:00
local
a5934e45b2 improved pdfcreator
1- use CLI
2- refactor code
2026-01-19 23:06:56 +00:00
local
6c4b78f274 update pdf creator
TOC, harmonize page size
2026-01-19 17:37:16 +00:00
local
7891956d52 update on various python tools 2026-01-19 15:21:44 +00:00
dl92
31684ecded added feed backloop 2026-01-14 02:04:31 +00:00
dl92
7f3ea89297 voice_to_terminal#1
automate terminal with voice
2026-01-14 01:46:31 +00:00
dl92
2a5347d1b9 add README 2026-01-14 00:34:39 +00:00
dl92
781659a552 command line app STT, text to local LLM 2026-01-14 00:20:56 +00:00
dl92
f09b390d90 v2 - works
amazing made a screecast out of this
2026-01-13 18:01:38 +00:00
dl92
33248895ff initial
gemini v1
2026-01-13 16:47:04 +00:00