add README

2026-01-14 00:34:39 +00:00
parent 781659a552
commit 2a5347d1b9
1 changed files with 19 additions and 0 deletions
--- a/python/tool-speechtotext/README.md
+++ b/python/tool-speechtotext/README.md
@@ -0,0 +1,19 @@
+
+# Purpose
+speech to text command line utility by leveraging off ollama a local speech-to-text model
+
+## Setup
+
+```bash
+# Create the environment with Python 3.10 and CUDA toolkit
+mamba create -n whisper-ollama python=3.10 nvidia/label/cuda-12.2.0::cuda-toolkit cudnn -c nvidia -c conda-forge -y
+
+# Activate the environment
+mamba activate whisper-ollama
+
+# Install Audio and Logic dependencies
+# Note: portaudio is required for sounddevice to work on Linux
+sudo apt-get update && sudo apt-get install libportaudio2 -y
+
+pip install faster-whisper sounddevice numpy pyperclip requests
+```