family/Code

Files

History

local 7891956d52 update on various python tools

2026-01-19 15:21:44 +00:00

..

initial

2026-01-13 16:47:04 +00:00

assistant.py

update on various python tools

2026-01-19 15:21:44 +00:00

README.md

add README

2026-01-14 00:34:39 +00:00

talk.sh

command line app STT, text to local LLM

2026-01-14 00:20:56 +00:00

terminal.sh

voice_to_terminal#1

2026-01-14 01:46:31 +00:00

voice_to_terminal.py

added feed backloop

2026-01-14 02:04:31 +00:00

README.md

Purpose

speech to text command line utility by leveraging off ollama a local speech-to-text model

Setup

# Create the environment with Python 3.10 and CUDA toolkit
mamba create -n whisper-ollama python=3.10 nvidia/label/cuda-12.2.0::cuda-toolkit cudnn -c nvidia -c conda-forge -y

# Activate the environment
mamba activate whisper-ollama

# Install Audio and Logic dependencies
# Note: portaudio is required for sounddevice to work on Linux
sudo apt-get update && sudo apt-get install libportaudio2 -y

pip install faster-whisper sounddevice numpy pyperclip requests