Add persian-tutor: Gradio-based GCSE Persian language learning app

Vocabulary study with FSRS spaced repetition, AI tutoring (Ollama/Claude), essay marking, idioms browser, Anki export, and dashboard. 918 vocabulary entries across 39 categories. 41 tests passing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Refactor tool-speechtotext: extract sttlib shared library and add tests
2026-02-08 01:57:44 +00:00 · 2026-02-08 00:40:31 +00:00
37 changed files with 11144 additions and 195 deletions
--- a/python/persian-tutor/CLAUDE.md
+++ b/python/persian-tutor/CLAUDE.md
@@ -0,0 +1,38 @@
 # Persian Language Tutor
 ## Overview
 Gradio-based Persian (Farsi) language learning app for English speakers, using GCSE Persian vocabulary (Pearson spec) as seed data.
 ## Tech Stack
 - **Frontend**: Gradio (browser handles RTL natively)
 - **Spaced repetition**: py-fsrs (same algorithm as Anki)
 - **AI**: Ollama (fast, local) + Claude CLI (smart, subprocess)
 - **STT**: faster-whisper via sttlib from tool-speechtotext
 - **Anki export**: genanki for .apkg generation
 - **Database**: SQLite (file-based, data/progress.db)
 - **Environment**: `whisper-ollama` conda env
 ## Running
 ```bash
 mamba run -n whisper-ollama python app.py
 ```
 ## Testing
 ```bash
 mamba run -n whisper-ollama python -m pytest tests/
 ```
 ## Key Paths
 - `data/vocabulary.json` — GCSE vocabulary data
 - `data/progress.db` — SQLite database (auto-created)
 - `app.py` — Gradio entry point
 - `db.py` — Database layer with FSRS integration
 - `ai.py` — Dual AI backend (Ollama + Claude)
 - `stt.py` — Persian speech-to-text wrapper
 - `modules/` — Feature modules (vocab, dashboard, essay, tutor, idioms)
 ## Architecture
 - Single-process Gradio app with shared SQLite connection
 - FSRS Card objects serialized as JSON in SQLite TEXT columns
 - Timestamps stored as ISO-8601 strings
 - sttlib imported via sys.path from tool-speechtotext project
--- a/python/persian-tutor/README.md
+++ b/python/persian-tutor/README.md
@@ -0,0 +1,57 @@
 # Persian Language Tutor
 A Gradio-based Persian (Farsi) language learning app for English speakers, built around GCSE Persian vocabulary (Pearson specification).
 ## Features
 - **Vocabulary Study** — Search, browse, and study 918 GCSE Persian words across 39 categories
 - **Flashcards with FSRS** — Spaced repetition scheduling (same algorithm as Anki)
 - **Idioms & Expressions** — 25 Persian social conventions with cultural context
 - **AI Tutor** — Conversational Persian lessons by GCSE theme (via Ollama)
 - **Essay Marking** — Write Persian essays, get AI feedback and grading (via Claude)
 - **Dashboard** — Track progress, streaks, and mastery
 - **Anki Export** — Generate .apkg decks for offline study
 - **Voice Input** — Speak Persian via microphone (Whisper STT) in the Tutor tab
 ## Prerequisites
 - `whisper-ollama` conda environment with Python 3.10+
 - Ollama running locally with `qwen2.5:7b` (or another model)
 - Claude CLI installed (for essay marking / smart mode)
 ## Setup
 ```bash
 /home/ys/miniforge3/envs/whisper-ollama/bin/pip install gradio genanki fsrs
 ```
 ## Running the app
 ```bash
 cd /home/ys/family-repo/Code/python/persian-tutor
 /home/ys/miniforge3/envs/whisper-ollama/bin/python app.py
 ```
 Then open http://localhost:7860 in your browser.
 ## Running tests
 ```bash
 cd /home/ys/family-repo/Code/python/persian-tutor
 /home/ys/miniforge3/envs/whisper-ollama/bin/python -m pytest tests/ -v
 ```
 41 tests covering db, vocab, ai, and anki_export modules.
 ## Expanding vocabulary
 The vocabulary can be expanded by editing `data/vocabulary.json` directly or by updating `scripts/build_vocab.py` and re-running it:
 ```bash
 /home/ys/miniforge3/envs/whisper-ollama/bin/python scripts/build_vocab.py
 ```
 ## TODO
 - [ ] Voice-based vocabulary testing — answer flashcard prompts by speaking Persian
 - [ ] Improved UI theme and layout polish
--- a/python/persian-tutor/ai.py
+++ b/python/persian-tutor/ai.py
@@ -0,0 +1,44 @@
 """Dual AI backend: Ollama (fast/local) and Claude CLI (smart)."""
 import subprocess
 import ollama
 DEFAULT_OLLAMA_MODEL = "qwen2.5:7b"
 def ask_ollama(prompt, system=None, model=DEFAULT_OLLAMA_MODEL):
    """Query Ollama with an optional system prompt."""
    messages = []
    if system:
        messages.append({"role": "system", "content": system})
    messages.append({"role": "user", "content": prompt})
    response = ollama.chat(model=model, messages=messages)
    return response.message.content
 def ask_claude(prompt):
    """Query Claude via the CLI subprocess."""
    result = subprocess.run(
        ["claude", "-p", prompt],
        capture_output=True,
        text=True,
    )
    return result.stdout.strip()
 def ask(prompt, system=None, quality="fast"):
    """Unified interface. quality='fast' uses Ollama, 'smart' uses Claude."""
    if quality == "smart":
        return ask_claude(prompt)
    return ask_ollama(prompt, system=system)
 def chat_ollama(messages, system=None, model=DEFAULT_OLLAMA_MODEL):
    """Multi-turn conversation with Ollama."""
    all_messages = []
    if system:
        all_messages.append({"role": "system", "content": system})
    all_messages.extend(messages)
    response = ollama.chat(model=model, messages=all_messages)
    return response.message.content
--- a/python/persian-tutor/anki_export.py
+++ b/python/persian-tutor/anki_export.py
@@ -0,0 +1,76 @@
 """Generate Anki .apkg decks from vocabulary data."""
 import genanki
 import random
 # Stable model/deck IDs (generated once, kept constant)
 _MODEL_ID = 1607392319
 _DECK_ID = 2059400110
 def _make_model():
    """Create an Anki note model with two card templates."""
    return genanki.Model(
        _MODEL_ID,
        "GCSE Persian",
        fields=[
            {"name": "English"},
            {"name": "Persian"},
            {"name": "Finglish"},
            {"name": "Category"},
        ],
        templates=[
            {
                "name": "English → Persian",
                "qfmt": '<div style="font-size:1.5em">{{English}}</div>'
                '<br><small>{{Category}}</small>',
                "afmt": '{{FrontSide}}<hr id="answer">'
                '<div dir="rtl" style="font-size:2em">{{Persian}}</div>'
                "<br><div>{{Finglish}}</div>",
            },
            {
                "name": "Persian → English",
                "qfmt": '<div dir="rtl" style="font-size:2em">{{Persian}}</div>'
                '<br><small>{{Category}}</small>',
                "afmt": '{{FrontSide}}<hr id="answer">'
                '<div style="font-size:1.5em">{{English}}</div>'
                "<br><div>{{Finglish}}</div>",
            },
        ],
        css=".card { font-family: arial; text-align: center; }",
    )
 def export_deck(vocab, categories=None, output_path="gcse-persian.apkg"):
    """Generate an Anki .apkg deck from vocabulary entries.
    Args:
        vocab: List of vocabulary entries (dicts with english, persian, finglish, category).
        categories: Optional list of categories to include. None = all.
        output_path: Where to save the .apkg file.
    Returns:
        Path to the generated .apkg file.
    """
    model = _make_model()
    deck = genanki.Deck(_DECK_ID, "GCSE Persian")
    for entry in vocab:
        if categories and entry.get("category") not in categories:
            continue
        note = genanki.Note(
            model=model,
            fields=[
                entry.get("english", ""),
                entry.get("persian", ""),
                entry.get("finglish", ""),
                entry.get("category", ""),
            ],
            guid=genanki.guid_for(entry.get("id", entry["english"])),
        )
        deck.add_note(note)
    package = genanki.Package(deck)
    package.write_to_file(output_path)
    return output_path
--- a/python/persian-tutor/app.py
+++ b/python/persian-tutor/app.py
@@ -0,0 +1,511 @@
 """Persian Language Tutor — Gradio UI."""
 import json
 import os
 import tempfile
 import time
 import gradio as gr
 import db
 from modules import vocab, dashboard, essay, tutor, idioms
 from modules.essay import GCSE_THEMES
 from modules.tutor import THEME_PROMPTS
 from anki_export import export_deck
 # ---------- Initialise ----------
 db.init_db()
 vocabulary = vocab.load_vocab()
 categories = ["All"] + vocab.get_categories()
 # ---------- Helper ----------
 def _rtl(text, size="2em"):
    return f'<div dir="rtl" style="font-size:{size}; text-align:center">{text}</div>'
 # ================================================================
 # TAB HANDLERS
 # ================================================================
 # ---------- Dashboard ----------
 def refresh_dashboard():
    overview_md = dashboard.format_overview_markdown()
    cat_data = dashboard.get_category_breakdown()
    quiz_data = dashboard.get_recent_quizzes()
    return overview_md, cat_data, quiz_data
 # ---------- Vocabulary Search ----------
 def do_search(query, category):
    results = vocab.search(query)
    if category and category != "All":
        results = [r for r in results if r["category"] == category]
    if not results:
        return "No results found."
    lines = []
    for r in results:
        status = vocab.get_word_status(r["id"])
        icon = {"new": "⬜", "learning": "🟨", "mastered": "🟩"}.get(status, "⬜")
        lines.append(
            f'{icon} **{r["english"]}** — '
            f'<span dir="rtl">{r["persian"]}</span>'
            f' ({r.get("finglish", "")})'
        )
    return "\n\n".join(lines)
 def do_random_word(category, transliteration):
    entry = vocab.get_random_word(category=category)
    if not entry:
        return "No words found."
    return vocab.format_word_card(entry, show_transliteration=transliteration)
 # ---------- Flashcards ----------
 def start_flashcards(category, direction):
    batch = vocab.get_flashcard_batch(count=10, category=category)
    if not batch:
        return "No words available.", [], 0, 0, "", gr.update(visible=False)
    first = batch[0]
    if direction == "English → Persian":
        prompt = f'<div style="font-size:2em; text-align:center">{first["english"]}</div>'
    else:
        prompt = _rtl(first["persian"])
    return (
        prompt,                    # card_display
        batch,                     # batch state
        0,                         # current index
        0,                         # score
        "",                        # answer_box cleared
        gr.update(visible=True),   # answer_area visible
    )
 def submit_answer(user_answer, batch, index, score, direction, transliteration):
    if not batch or index >= len(batch):
        return "Session complete!", batch, index, score, "", gr.update(visible=False), ""
    entry = batch[index]
    dir_key = "en_to_fa" if direction == "English → Persian" else "fa_to_en"
    is_correct, correct_answer, _ = vocab.check_answer(entry["id"], user_answer, direction=dir_key)
    if is_correct:
        score += 1
        result = "✅ **Correct!**"
    else:
        result = f"❌ **Incorrect.** The answer is: "
        if dir_key == "en_to_fa":
            result += f'<span dir="rtl">{correct_answer}</span>'
        else:
            result += correct_answer
    card_info = vocab.format_word_card(entry, show_transliteration=transliteration)
    feedback = f"{result}\n\n{card_info}\n\n---\n*Rate your recall to continue:*"
    return feedback, batch, index, score, "", gr.update(visible=True), ""
 def rate_and_next(rating_str, batch, index, score, direction):
    if not batch or index >= len(batch):
        return "Session complete!", batch, index, score, gr.update(visible=False)
    import fsrs as fsrs_mod
    rating_map = {
        "Again": fsrs_mod.Rating.Again,
        "Hard": fsrs_mod.Rating.Hard,
        "Good": fsrs_mod.Rating.Good,
        "Easy": fsrs_mod.Rating.Easy,
    }
    rating = rating_map.get(rating_str, fsrs_mod.Rating.Good)
    entry = batch[index]
    db.update_word_progress(entry["id"], rating)
    index += 1
    if index >= len(batch):
        summary = f"## Session Complete!\n\n**Score:** {score}/{len(batch)}\n\n"
        summary += f"**Accuracy:** {score/len(batch)*100:.0f}%"
        return summary, batch, index, score, gr.update(visible=False)
    next_entry = batch[index]
    if direction == "English → Persian":
        prompt = f'<div style="font-size:2em; text-align:center">{next_entry["english"]}</div>'
    else:
        prompt = _rtl(next_entry["persian"])
    return prompt, batch, index, score, gr.update(visible=True)
 # ---------- Idioms ----------
 def show_random_idiom(transliteration):
    expr = idioms.get_random_expression()
    return idioms.format_expression(expr, show_transliteration=transliteration), expr
 def explain_idiom(expr_state):
    if not expr_state:
        return "Pick an idiom first."
    return idioms.explain_expression(expr_state)
 def browse_idioms(transliteration):
    exprs = idioms.get_all_expressions()
    lines = []
    for e in exprs:
        line = f'**<span dir="rtl">{e["persian"]}</span>** — {e["english"]}'
        if transliteration != "off":
            line += f' *({e["finglish"]})*'
        lines.append(line)
    return "\n\n".join(lines)
 # ---------- Tutor ----------
 def start_tutor_lesson(theme):
    response, messages, system = tutor.start_lesson(theme)
    chat_history = [{"role": "assistant", "content": response}]
    return chat_history, messages, system, time.time()
 def send_tutor_message(user_msg, chat_history, messages, system, audio_input):
    # Use STT if audio provided and no text
    if audio_input is not None and (not user_msg or not user_msg.strip()):
        try:
            from stt import transcribe_persian
            user_msg = transcribe_persian(audio_input)
        except Exception:
            user_msg = ""
    if not user_msg or not user_msg.strip():
        return chat_history, messages, "", None
    response, messages = tutor.process_response(user_msg, messages, system=system)
    chat_history.append({"role": "user", "content": user_msg})
    chat_history.append({"role": "assistant", "content": response})
    return chat_history, messages, "", None
 def save_tutor(theme, messages, start_time):
    if messages and len(messages) > 1:
        tutor.save_session(theme, messages, start_time)
        return "Session saved!"
    return "Nothing to save."
 # ---------- Essay ----------
 def submit_essay(text, theme):
    if not text or not text.strip():
        return "Please write an essay first."
    return essay.mark_essay(text, theme)
 def load_essay_history():
    return essay.get_essay_history()
 # ---------- Settings / Export ----------
 def do_anki_export(cats_selected):
    v = vocab.load_vocab()
    cats = cats_selected if cats_selected else None
    path = os.path.join(tempfile.gettempdir(), "gcse-persian.apkg")
    export_deck(v, categories=cats, output_path=path)
    return path
 def reset_progress():
    conn = db.get_connection()
    conn.execute("DELETE FROM word_progress")
    conn.execute("DELETE FROM quiz_sessions")
    conn.execute("DELETE FROM essays")
    conn.execute("DELETE FROM tutor_sessions")
    conn.commit()
    return "Progress reset."
 # ================================================================
 # GRADIO UI
 # ================================================================
 with gr.Blocks(title="Persian Language Tutor") as app:
    gr.Markdown("# 🇮🇷 Persian Language Tutor\n*GCSE Persian vocabulary with spaced repetition*")
    # Shared state
    transliteration_state = gr.State(value="Finglish")
    with gr.Tabs():
        # ==================== DASHBOARD ====================
        with gr.Tab("📊 Dashboard"):
            overview_md = gr.Markdown("Loading...")
            with gr.Row():
                cat_table = gr.Dataframe(
                    headers=["Category", "Total", "Seen", "Mastered", "Progress"],
                    label="Category Breakdown",
                )
            quiz_table = gr.Dataframe(
                headers=["Date", "Category", "Score", "Duration"],
                label="Recent Quizzes",
            )
            refresh_btn = gr.Button("Refresh", variant="secondary")
            refresh_btn.click(
                fn=refresh_dashboard,
                outputs=[overview_md, cat_table, quiz_table],
            )
        # ==================== VOCABULARY ====================
        with gr.Tab("📚 Vocabulary"):
            with gr.Row():
                search_box = gr.Textbox(
                    label="Search (English or Persian)",
                    placeholder="Type to search...",
                )
                vocab_cat = gr.Dropdown(
                    choices=categories, value="All", label="Category"
                )
            search_btn = gr.Button("Search", variant="primary")
            random_btn = gr.Button("Random Word")
            search_results = gr.Markdown("Search for a word above.")
            search_btn.click(
                fn=do_search,
                inputs=[search_box, vocab_cat],
                outputs=[search_results],
            )
            search_box.submit(
                fn=do_search,
                inputs=[search_box, vocab_cat],
                outputs=[search_results],
            )
            random_btn.click(
                fn=do_random_word,
                inputs=[vocab_cat, transliteration_state],
                outputs=[search_results],
            )
        # ==================== FLASHCARDS ====================
        with gr.Tab("🃏 Flashcards"):
            with gr.Row():
                fc_category = gr.Dropdown(
                    choices=categories, value="All", label="Category"
                )
                fc_direction = gr.Radio(
                    ["English → Persian", "Persian → English"],
                    value="English → Persian",
                    label="Direction",
                )
            start_fc_btn = gr.Button("Start Session", variant="primary")
            card_display = gr.Markdown("Press 'Start Session' to begin.")
            # Hidden states
            fc_batch = gr.State([])
            fc_index = gr.State(0)
            fc_score = gr.State(0)
            with gr.Group(visible=False) as answer_area:
                answer_box = gr.Textbox(
                    label="Your answer",
                    placeholder="Type your answer...",
                    rtl=True,
                )
                submit_ans_btn = gr.Button("Submit Answer", variant="primary")
                answer_feedback = gr.Markdown("")
                with gr.Row():
                    btn_again = gr.Button("Again", variant="stop")
                    btn_hard = gr.Button("Hard", variant="secondary")
                    btn_good = gr.Button("Good", variant="primary")
                    btn_easy = gr.Button("Easy", variant="secondary")
            start_fc_btn.click(
                fn=start_flashcards,
                inputs=[fc_category, fc_direction],
                outputs=[card_display, fc_batch, fc_index, fc_score, answer_box, answer_area],
            )
            submit_ans_btn.click(
                fn=submit_answer,
                inputs=[answer_box, fc_batch, fc_index, fc_score, fc_direction, transliteration_state],
                outputs=[card_display, fc_batch, fc_index, fc_score, answer_box, answer_area, answer_feedback],
            )
            answer_box.submit(
                fn=submit_answer,
                inputs=[answer_box, fc_batch, fc_index, fc_score, fc_direction, transliteration_state],
                outputs=[card_display, fc_batch, fc_index, fc_score, answer_box, answer_area, answer_feedback],
            )
            for btn, label in [(btn_again, "Again"), (btn_hard, "Hard"), (btn_good, "Good"), (btn_easy, "Easy")]:
                btn.click(
                    fn=rate_and_next,
                    inputs=[gr.State(label), fc_batch, fc_index, fc_score, fc_direction],
                    outputs=[card_display, fc_batch, fc_index, fc_score, answer_area],
                )
        # ==================== IDIOMS ====================
        with gr.Tab("💬 Idioms & Expressions"):
            idiom_display = gr.Markdown("Click 'Random Idiom' or browse below.")
            idiom_state = gr.State(None)
            with gr.Row():
                random_idiom_btn = gr.Button("Random Idiom", variant="primary")
                explain_idiom_btn = gr.Button("Explain Usage")
                browse_idiom_btn = gr.Button("Browse All")
            idiom_explanation = gr.Markdown("")
            random_idiom_btn.click(
                fn=show_random_idiom,
                inputs=[transliteration_state],
                outputs=[idiom_display, idiom_state],
            )
            explain_idiom_btn.click(
                fn=explain_idiom,
                inputs=[idiom_state],
                outputs=[idiom_explanation],
            )
            browse_idiom_btn.click(
                fn=browse_idioms,
                inputs=[transliteration_state],
                outputs=[idiom_display],
            )
        # ==================== TUTOR ====================
        with gr.Tab("🎓 Tutor"):
            tutor_theme = gr.Dropdown(
                choices=list(THEME_PROMPTS.keys()),
                value="Identity and culture",
                label="Theme",
            )
            start_lesson_btn = gr.Button("New Lesson", variant="primary")
            chatbot = gr.Chatbot(label="Conversation")
            # Tutor states
            tutor_messages = gr.State([])
            tutor_system = gr.State("")
            tutor_start_time = gr.State(0)
            with gr.Row():
                tutor_input = gr.Textbox(
                    label="Your message",
                    placeholder="Type in English or Persian...",
                    scale=3,
                )
                tutor_mic = gr.Audio(
                    sources=["microphone"],
                    type="numpy",
                    label="Speak",
                    scale=1,
                )
            send_btn = gr.Button("Send", variant="primary")
            save_btn = gr.Button("Save Session", variant="secondary")
            save_status = gr.Markdown("")
            start_lesson_btn.click(
                fn=start_tutor_lesson,
                inputs=[tutor_theme],
                outputs=[chatbot, tutor_messages, tutor_system, tutor_start_time],
            )
            send_btn.click(
                fn=send_tutor_message,
                inputs=[tutor_input, chatbot, tutor_messages, tutor_system, tutor_mic],
                outputs=[chatbot, tutor_messages, tutor_input, tutor_mic],
            )
            tutor_input.submit(
                fn=send_tutor_message,
                inputs=[tutor_input, chatbot, tutor_messages, tutor_system, tutor_mic],
                outputs=[chatbot, tutor_messages, tutor_input, tutor_mic],
            )
            save_btn.click(
                fn=save_tutor,
                inputs=[tutor_theme, tutor_messages, tutor_start_time],
                outputs=[save_status],
            )
        # ==================== ESSAY ====================
        with gr.Tab("✍️ Essay"):
            essay_theme = gr.Dropdown(
                choices=GCSE_THEMES,
                value="Identity and culture",
                label="Theme",
            )
            essay_input = gr.Textbox(
                label="Write your essay in Persian",
                lines=10,
                rtl=True,
                placeholder="اینجا بنویسید...",
            )
            submit_essay_btn = gr.Button("Submit for Marking", variant="primary")
            essay_feedback = gr.Markdown("Write an essay and submit for AI marking.")
            gr.Markdown("### Essay History")
            essay_history_table = gr.Dataframe(
                headers=["Date", "Theme", "Grade", "Preview"],
                label="Past Essays",
            )
            refresh_essays_btn = gr.Button("Refresh History")
            submit_essay_btn.click(
                fn=submit_essay,
                inputs=[essay_input, essay_theme],
                outputs=[essay_feedback],
            )
            refresh_essays_btn.click(
                fn=load_essay_history,
                outputs=[essay_history_table],
            )
        # ==================== SETTINGS ====================
        with gr.Tab("⚙️ Settings"):
            gr.Markdown("## Settings")
            transliteration_radio = gr.Radio(
                ["off", "Finglish", "Academic"],
                value="Finglish",
                label="Transliteration",
            )
            ollama_model = gr.Textbox(
                label="Ollama Model",
                value="qwen2.5:7b",
                info="Model used for fast AI responses",
            )
            whisper_size = gr.Dropdown(
                choices=["tiny", "base", "small", "medium", "large-v3"],
                value="medium",
                label="Whisper Model Size",
            )
            gr.Markdown("### Anki Export")
            export_cats = gr.Dropdown(
                choices=vocab.get_categories(),
                multiselect=True,
                label="Categories to export (empty = all)",
            )
            export_btn = gr.Button("Export to Anki (.apkg)", variant="primary")
            export_file = gr.File(label="Download")
            export_btn.click(fn=do_anki_export, inputs=[export_cats], outputs=[export_file])
            gr.Markdown("### Reset")
            reset_btn = gr.Button("Reset All Progress", variant="stop")
            reset_status = gr.Markdown("")
            reset_btn.click(fn=reset_progress, outputs=[reset_status])
            # Wire transliteration state
            transliteration_radio.change(
                fn=lambda x: x,
                inputs=[transliteration_radio],
                outputs=[transliteration_state],
            )
    # Load dashboard on app start
    app.load(fn=refresh_dashboard, outputs=[overview_md, cat_table, quiz_table])
 if __name__ == "__main__":
    app.launch(theme=gr.themes.Soft())
--- a/python/persian-tutor/data/vocabulary.json
+++ b/python/persian-tutor/data/vocabulary.json
--- a/python/persian-tutor/db.py
+++ b/python/persian-tutor/db.py
@@ -0,0 +1,234 @@
 """SQLite database layer with FSRS spaced repetition integration."""
 import json
 import sqlite3
 from datetime import datetime, timezone
 from pathlib import Path
 import fsrs
 DB_PATH = Path(__file__).parent / "data" / "progress.db"
 _conn = None
 _scheduler = fsrs.Scheduler()
 def get_connection():
    """Return the shared SQLite connection (singleton)."""
    global _conn
    if _conn is None:
        DB_PATH.parent.mkdir(parents=True, exist_ok=True)
        _conn = sqlite3.connect(str(DB_PATH), check_same_thread=False)
        _conn.row_factory = sqlite3.Row
        _conn.execute("PRAGMA journal_mode=WAL")
    return _conn
 def init_db():
    """Create all tables if they don't exist. Called once at startup."""
    conn = get_connection()
    conn.executescript("""
        CREATE TABLE IF NOT EXISTS word_progress (
            word_id TEXT PRIMARY KEY,
            fsrs_state TEXT,
            due TIMESTAMP,
            stability REAL,
            difficulty REAL,
            reps INTEGER DEFAULT 0,
            lapses INTEGER DEFAULT 0,
            last_review TIMESTAMP
        );
        CREATE TABLE IF NOT EXISTS quiz_sessions (
            id INTEGER PRIMARY KEY AUTOINCREMENT,
            timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
            category TEXT,
            total_questions INTEGER,
            correct INTEGER,
            duration_seconds INTEGER
        );
        CREATE TABLE IF NOT EXISTS essays (
            id INTEGER PRIMARY KEY AUTOINCREMENT,
            timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
            essay_text TEXT,
            grade TEXT,
            feedback TEXT,
            theme TEXT
        );
        CREATE TABLE IF NOT EXISTS tutor_sessions (
            id INTEGER PRIMARY KEY AUTOINCREMENT,
            timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
            theme TEXT,
            messages TEXT,
            duration_seconds INTEGER
        );
    """)
    conn.commit()
 def get_word_progress(word_id):
    """Return learning state for one word, or None if never reviewed."""
    conn = get_connection()
    row = conn.execute(
        "SELECT * FROM word_progress WHERE word_id = ?", (word_id,)
    ).fetchone()
    return dict(row) if row else None
 def update_word_progress(word_id, rating):
    """Run FSRS algorithm, update due date/stability/difficulty.
    Args:
        word_id: Vocabulary entry ID.
        rating: fsrs.Rating value (Again=1, Hard=2, Good=3, Easy=4).
    """
    conn = get_connection()
    existing = get_word_progress(word_id)
    if existing and existing["fsrs_state"]:
        card = fsrs.Card.from_dict(json.loads(existing["fsrs_state"]))
    else:
        card = fsrs.Card()
    card, review_log = _scheduler.review_card(card, rating)
    now = datetime.now(timezone.utc).isoformat()
    card_json = json.dumps(card.to_dict(), default=str)
    conn.execute(
        """INSERT OR REPLACE INTO word_progress
           (word_id, fsrs_state, due, stability, difficulty, reps, lapses, last_review)
           VALUES (?, ?, ?, ?, ?, ?, ?, ?)""",
        (
            word_id,
            card_json,
            card.due.isoformat(),
            card.stability,
            card.difficulty,
            (existing["reps"] + 1) if existing else 1,
            existing["lapses"] if existing else 0,
            now,
        ),
    )
    conn.commit()
    return card
 def get_due_words(limit=20):
    """Return word IDs where due <= now, ordered by due date."""
    conn = get_connection()
    now = datetime.now(timezone.utc).isoformat()
    rows = conn.execute(
        "SELECT word_id FROM word_progress WHERE due <= ? ORDER BY due LIMIT ?",
        (now, limit),
    ).fetchall()
    return [row["word_id"] for row in rows]
 def get_word_counts(total_vocab_size=0):
    """Return dict with total/seen/mastered/due counts for dashboard."""
    conn = get_connection()
    now = datetime.now(timezone.utc).isoformat()
    seen = conn.execute("SELECT COUNT(*) FROM word_progress").fetchone()[0]
    mastered = conn.execute(
        "SELECT COUNT(*) FROM word_progress WHERE stability > 10"
    ).fetchone()[0]
    due = conn.execute(
        "SELECT COUNT(*) FROM word_progress WHERE due <= ?", (now,)
    ).fetchone()[0]
    return {
        "total": total_vocab_size,
        "seen": seen,
        "mastered": mastered,
        "due": due,
    }
 def record_quiz_session(category, total_questions, correct, duration_seconds):
    """Log a completed flashcard session."""
    conn = get_connection()
    conn.execute(
        "INSERT INTO quiz_sessions (category, total_questions, correct, duration_seconds) VALUES (?, ?, ?, ?)",
        (category, total_questions, correct, duration_seconds),
    )
    conn.commit()
 def save_essay(essay_text, grade, feedback, theme):
    """Save an essay + AI feedback."""
    conn = get_connection()
    conn.execute(
        "INSERT INTO essays (essay_text, grade, feedback, theme) VALUES (?, ?, ?, ?)",
        (essay_text, grade, feedback, theme),
    )
    conn.commit()
 def save_tutor_session(theme, messages, duration_seconds):
    """Save a tutor conversation."""
    conn = get_connection()
    conn.execute(
        "INSERT INTO tutor_sessions (theme, messages, duration_seconds) VALUES (?, ?, ?)",
        (theme, json.dumps(messages, ensure_ascii=False), duration_seconds),
    )
    conn.commit()
 def get_stats():
    """Aggregate data for the dashboard."""
    conn = get_connection()
    recent_quizzes = conn.execute(
        "SELECT * FROM quiz_sessions ORDER BY timestamp DESC LIMIT 10"
    ).fetchall()
    total_reviews = conn.execute(
        "SELECT COALESCE(SUM(reps), 0) FROM word_progress"
    ).fetchone()[0]
    total_quizzes = conn.execute(
        "SELECT COUNT(*) FROM quiz_sessions"
    ).fetchone()[0]
    # Streak: count consecutive days with activity
    days = conn.execute(
        "SELECT DISTINCT DATE(last_review) as d FROM word_progress WHERE last_review IS NOT NULL ORDER BY d DESC"
    ).fetchall()
    streak = 0
    today = datetime.now(timezone.utc).date()
    for i, row in enumerate(days):
        day = datetime.fromisoformat(row["d"]).date() if isinstance(row["d"], str) else row["d"]
        expected = today - __import__("datetime").timedelta(days=i)
        if day == expected:
            streak += 1
        else:
            break
    return {
        "recent_quizzes": [dict(r) for r in recent_quizzes],
        "total_reviews": total_reviews,
        "total_quizzes": total_quizzes,
        "streak": streak,
    }
 def get_recent_essays(limit=10):
    """Return recent essays for the essay history view."""
    conn = get_connection()
    rows = conn.execute(
        "SELECT * FROM essays ORDER BY timestamp DESC LIMIT ?", (limit,)
    ).fetchall()
    return [dict(r) for r in rows]
 def close():
    """Close the database connection."""
    global _conn
    if _conn:
        _conn.close()
        _conn = None
--- a/python/persian-tutor/modules/init.py
+++ b/python/persian-tutor/modules/init.py
--- a/python/persian-tutor/modules/dashboard.py
+++ b/python/persian-tutor/modules/dashboard.py
@@ -0,0 +1,84 @@
 """Dashboard: progress stats, charts, and overview."""
 import db
 from modules.vocab import load_vocab, get_categories
 def get_overview():
    """Return overview stats: total words, seen, mastered, due today."""
    vocab = load_vocab()
    counts = db.get_word_counts(total_vocab_size=len(vocab))
    stats = db.get_stats()
    counts["streak"] = stats["streak"]
    counts["total_reviews"] = stats["total_reviews"]
    counts["total_quizzes"] = stats["total_quizzes"]
    return counts
 def get_category_breakdown():
    """Return progress per category as list of dicts."""
    vocab = load_vocab()
    categories = get_categories()
    breakdown = []
    for cat in categories:
        cat_words = [e for e in vocab if e["category"] == cat]
        cat_ids = {e["id"] for e in cat_words}
        total = len(cat_words)
        seen = 0
        mastered = 0
        for wid in cat_ids:
            progress = db.get_word_progress(wid)
            if progress:
                seen += 1
                if progress["stability"] and progress["stability"] > 10:
                    mastered += 1
        breakdown.append({
            "Category": cat,
            "Total": total,
            "Seen": seen,
            "Mastered": mastered,
            "Progress": f"{seen}/{total}" if total > 0 else "0/0",
        })
    return breakdown
 def get_recent_quizzes(limit=10):
    """Return recent quiz results as list of dicts for display."""
    stats = db.get_stats()
    quizzes = stats["recent_quizzes"][:limit]
    result = []
    for q in quizzes:
        result.append({
            "Date": q["timestamp"],
            "Category": q["category"] or "All",
            "Score": f"{q['correct']}/{q['total_questions']}",
            "Duration": f"{q['duration_seconds'] or 0}s",
        })
    return result
 def format_overview_markdown():
    """Format overview stats as a markdown string for display."""
    o = get_overview()
    pct = (o["seen"] / o["total"] * 100) if o["total"] > 0 else 0
    bar_filled = int(pct / 5)
    bar_empty = 20 - bar_filled
    progress_bar = "█" * bar_filled + "░" * bar_empty
    lines = [
        "## Dashboard",
        "",
        f"**Words studied:** {o['seen']} / {o['total']} ({pct:.0f}%)",
        f"`{progress_bar}`",
        "",
        f"**Due today:** {o['due']}",
        f"**Mastered:** {o['mastered']}",
        f"**Daily streak:** {o['streak']} day{'s' if o['streak'] != 1 else ''}",
        f"**Total reviews:** {o['total_reviews']}",
        f"**Quiz sessions:** {o['total_quizzes']}",
    ]
    return "\n".join(lines)
--- a/python/persian-tutor/modules/essay.py
+++ b/python/persian-tutor/modules/essay.py
@@ -0,0 +1,78 @@
 """Essay writing and AI marking."""
 import db
 from ai import ask
 MARKING_SYSTEM_PROMPT = """You are an expert Persian (Farsi) language teacher marking a GCSE-level essay.
 You write in English but can read and correct Persian text.
 Always provide constructive, encouraging feedback suitable for a language learner."""
 MARKING_PROMPT_TEMPLATE = """Please mark this Persian essay written by a GCSE student.
 Theme: {theme}
 Student's essay:
 {essay_text}
 Please provide your response in this exact format:
 **Grade:** [Give a grade from 1-9 matching GCSE grading, or a descriptive level like A2/B1]
 **Summary:** [1-2 sentence overview of the essay quality]
 **Corrections:**
 [List specific errors with corrections. For each error, show the original text and the corrected version in Persian, with an English explanation]
 **Improved version:**
 [Rewrite the essay in corrected Persian]
 **Tips for improvement:**
 [3-5 specific, actionable tips for the student]"""
 GCSE_THEMES = [
    "Identity and culture",
    "Local area and environment",
    "School and work",
    "Travel and tourism",
    "International and global dimension",
 ]
 def mark_essay(essay_text, theme="General"):
    """Send essay to AI for marking. Returns structured feedback."""
    if not essay_text or not essay_text.strip():
        return "Please write an essay first."
    prompt = MARKING_PROMPT_TEMPLATE.format(
        theme=theme,
        essay_text=essay_text.strip(),
    )
    feedback = ask(prompt, system=MARKING_SYSTEM_PROMPT, quality="smart")
    # Extract grade from feedback (best-effort)
    grade = ""
    for line in feedback.split("\n"):
        if line.strip().startswith("**Grade:**"):
            grade = line.replace("**Grade:**", "").strip()
            break
    # Save to database
    db.save_essay(essay_text.strip(), grade, feedback, theme)
    return feedback
 def get_essay_history(limit=10):
    """Return recent essays for the history view."""
    essays = db.get_recent_essays(limit)
    result = []
    for e in essays:
        result.append({
            "Date": e["timestamp"],
            "Theme": e["theme"] or "General",
            "Grade": e["grade"] or "-",
            "Preview": (e["essay_text"] or "")[:50] + "...",
        })
    return result
--- a/python/persian-tutor/modules/idioms.py
+++ b/python/persian-tutor/modules/idioms.py
@@ -0,0 +1,200 @@
 """Persian idioms, expressions, and social conventions."""
 from ai import ask
 # Built-in collection of common Persian expressions and idioms
 EXPRESSIONS = [
    {
        "persian": "سلام علیکم",
        "finglish": "salâm aleykom",
        "english": "Peace be upon you (formal greeting)",
        "context": "Formal greeting, especially with elders",
    },
    {
        "persian": "خسته نباشید",
        "finglish": "khaste nabâshid",
        "english": "May you not be tired",
        "context": "Common greeting to someone who has been working. Used as 'hello' in shops, offices, etc.",
    },
    {
        "persian": "دستت درد نکنه",
        "finglish": "dastet dard nakone",
        "english": "May your hand not hurt",
        "context": "Thank you for your effort (after someone does something for you)",
    },
    {
        "persian": "قابلی نداره",
        "finglish": "ghâbeli nadâre",
        "english": "It's not worthy (of you)",
        "context": "You're welcome / Don't mention it — said when giving a gift or doing a favour",
    },
    {
        "persian": "تعارف نکن",
        "finglish": "ta'ârof nakon",
        "english": "Don't do ta'arof",
        "context": "Stop being politely modest — please accept! Part of Persian ta'arof culture.",
    },
    {
        "persian": "نوش جان",
        "finglish": "nush-e jân",
        "english": "May it nourish your soul",
        "context": "Said to someone eating — like 'bon appétit' or 'enjoy your meal'",
    },
    {
        "persian": "چشمت روز بد نبینه",
        "finglish": "cheshmet ruz-e bad nabine",
        "english": "May your eyes never see a bad day",
        "context": "A warm wish for someone's wellbeing",
    },
    {
        "persian": "قدمت روی چشم",
        "finglish": "ghadamet ru-ye cheshm",
        "english": "Your step is on my eye",
        "context": "Warm welcome — 'you're very welcome here'. Extremely hospitable expression.",
    },
    {
        "persian": "ان‌شاءالله",
        "finglish": "inshâ'allâh",
        "english": "God willing",
        "context": "Used when talking about future plans. Very common in daily speech.",
    },
    {
        "persian": "ماشاءالله",
        "finglish": "mâshâ'allâh",
        "english": "What God has willed",
        "context": "Expression of admiration or praise, also used to ward off the evil eye.",
    },
    {
        "persian": "الهی شکر",
        "finglish": "elâhi shokr",
        "english": "Thank God",
        "context": "Expression of gratitude, similar to 'thankfully'",
    },
    {
        "persian": "به سلامتی",
        "finglish": "be salâmati",
        "english": "To your health / Cheers",
        "context": "A toast or general well-wishing expression",
    },
    {
        "persian": "عید مبارک",
        "finglish": "eyd mobârak",
        "english": "Happy holiday/celebration",
        "context": "Used for any celebration, especially Nowruz",
    },
    {
        "persian": "تسلیت می‌گم",
        "finglish": "tasliyat migam",
        "english": "I offer my condolences",
        "context": "Expressing sympathy when someone has lost a loved one",
    },
    {
        "persian": "خدا بیامرزه",
        "finglish": "khodâ biâmorzesh",
        "english": "May God forgive them (rest in peace)",
        "context": "Said about someone who has passed away",
    },
    {
        "persian": "زبونت رو گاز بگیر",
        "finglish": "zaboonet ro gâz begir",
        "english": "Bite your tongue",
        "context": "Don't say such things! (similar to English 'touch wood')",
    },
    {
        "persian": "دمت گرم",
        "finglish": "damet garm",
        "english": "May your breath be warm",
        "context": "Well done! / Good for you! (informal, friendly praise)",
    },
    {
        "persian": "چشم",
        "finglish": "cheshm",
        "english": "On my eye (I will do it)",
        "context": "Respectful way of saying 'yes, I'll do it' — shows obedience/respect",
    },
    {
        "persian": "بفرمایید",
        "finglish": "befarmâyid",
        "english": "Please (go ahead / help yourself / come in)",
        "context": "Very versatile polite expression: offering food, inviting someone in, or giving way",
    },
    {
        "persian": "ببخشید",
        "finglish": "bebakhshid",
        "english": "Excuse me / I'm sorry",
        "context": "Used for both apologies and getting someone's attention",
    },
    {
        "persian": "مخلصیم",
        "finglish": "mokhlesim",
        "english": "I'm your humble servant",
        "context": "Polite/humble way of saying goodbye or responding to a compliment (ta'arof)",
    },
    {
        "persian": "سرت سلامت باشه",
        "finglish": "saret salâmat bâshe",
        "english": "May your head be safe",
        "context": "Expression of condolence — 'I'm sorry for your loss'",
    },
    {
        "persian": "روی ما رو زمین ننداز",
        "finglish": "ru-ye mâ ro zamin nandâz",
        "english": "Don't throw our face on the ground",
        "context": "Please don't refuse/embarrass us — said when insisting on a request",
    },
    {
        "persian": "قربونت برم",
        "finglish": "ghorboonet beram",
        "english": "I'd sacrifice myself for you",
        "context": "Term of endearment — very common among family and close friends",
    },
    {
        "persian": "جون دل",
        "finglish": "jun-e del",
        "english": "Life of my heart",
        "context": "Affectionate term used with loved ones",
    },
 ]
 def get_all_expressions():
    """Return all built-in expressions."""
    return EXPRESSIONS
 def get_random_expression():
    """Pick a random expression."""
    import random
    return random.choice(EXPRESSIONS)
 def explain_expression(expression):
    """Use AI to generate a detailed explanation with usage examples."""
    prompt = f"""Explain this Persian expression for an English-speaking student:
 Persian: {expression['persian']}
 Transliteration: {expression['finglish']}
 Literal meaning: {expression['english']}
 Context: {expression['context']}
 Please provide:
 1. A fuller explanation of when and how this is used
 2. The cultural context (ta'arof, hospitality, etc.)
 3. Two example dialogues showing it in use (in Persian with English translation)
 4. Any variations or related expressions
 Keep it concise and student-friendly."""
    return ask(prompt, quality="fast")
 def format_expression(expr, show_transliteration="off"):
    """Format an expression for display."""
    parts = [
        f'<div dir="rtl" style="font-size:1.8em; text-align:center">{expr["persian"]}</div>',
        f'<div style="text-align:center; font-size:1.2em">{expr["english"]}</div>',
    ]
    if show_transliteration != "off":
        parts.append(f'<div style="text-align:center; color:#666; font-style:italic">{expr["finglish"]}</div>')
    parts.append(f'<div style="text-align:center; color:#888; margin-top:0.5em">{expr["context"]}</div>')
    return "\n".join(parts)
--- a/python/persian-tutor/modules/tutor.py
+++ b/python/persian-tutor/modules/tutor.py
@@ -0,0 +1,65 @@
 """Conversational Persian lessons by GCSE theme."""
 import time
 import db
 from ai import chat_ollama
 TUTOR_SYSTEM_PROMPT = """You are a friendly Persian (Farsi) language tutor teaching English-speaking GCSE students.
 Rules:
 - Use a mix of English and Persian. Start mostly in English, gradually introducing more Persian.
 - When you write Persian, also provide the Finglish transliteration in parentheses.
 - Keep responses concise (2-4 sentences per turn).
 - Ask the student to practice: translate phrases, answer questions in Persian, or fill in blanks.
 - Correct mistakes gently and explain why.
 - Stay on the current theme/topic.
 - Use Iranian Persian (Farsi), not Dari or Tajik.
 - Adapt to the student's level based on their responses."""
 THEME_PROMPTS = {
    "Identity and culture": "Let's practice talking about family, personality, daily routines, and Persian celebrations like Nowruz!",
    "Local area and environment": "Let's practice talking about your home, neighbourhood, shopping, and the environment!",
    "School and work": "Let's practice talking about school subjects, school life, jobs, and future plans!",
    "Travel and tourism": "Let's practice talking about transport, directions, holidays, hotels, and restaurants!",
    "International and global dimension": "Let's practice talking about health, global issues, technology, and social media!",
    "Free conversation": "Let's have a free conversation in Persian! I'll help you along the way.",
 }
 def start_lesson(theme):
    """Generate the opening message for a new lesson.
    Returns:
        (assistant_message, messages_list)
    """
    intro = THEME_PROMPTS.get(theme, THEME_PROMPTS["Free conversation"])
    system = TUTOR_SYSTEM_PROMPT + f"\n\nCurrent topic: {theme}. {intro}"
    messages = [{"role": "user", "content": f"I'd like to practice Persian. Today's theme is: {theme}"}]
    response = chat_ollama(messages, system=system)
    messages.append({"role": "assistant", "content": response})
    return response, messages, system
 def process_response(user_input, messages, system=None):
    """Add user input to conversation, get AI response.
    Returns:
        (assistant_response, updated_messages)
    """
    if not user_input or not user_input.strip():
        return "", messages
    messages.append({"role": "user", "content": user_input.strip()})
    response = chat_ollama(messages, system=system)
    messages.append({"role": "assistant", "content": response})
    return response, messages
 def save_session(theme, messages, start_time):
    """Save the current tutor session to the database."""
    duration = int(time.time() - start_time)
    db.save_tutor_session(theme, messages, duration)
--- a/python/persian-tutor/modules/vocab.py
+++ b/python/persian-tutor/modules/vocab.py
@@ -0,0 +1,152 @@
 """Vocabulary search, flashcard logic, and FSRS-driven review."""
 import json
 import random
 from pathlib import Path
 import fsrs
 import db
 VOCAB_PATH = Path(__file__).parent.parent / "data" / "vocabulary.json"
 _vocab_data = None
 def load_vocab():
    """Load vocabulary data from JSON (cached)."""
    global _vocab_data
    if _vocab_data is None:
        with open(VOCAB_PATH, encoding="utf-8") as f:
            _vocab_data = json.load(f)
    return _vocab_data
 def get_categories():
    """Return sorted list of unique categories."""
    vocab = load_vocab()
    return sorted({entry["category"] for entry in vocab})
 def get_sections():
    """Return sorted list of unique sections."""
    vocab = load_vocab()
    return sorted({entry["section"] for entry in vocab})
 def search(query, vocab_data=None):
    """Search vocabulary by English or Persian text. Returns matching entries."""
    if not query or not query.strip():
        return []
    vocab = vocab_data or load_vocab()
    query_lower = query.strip().lower()
    results = []
    for entry in vocab:
        if (
            query_lower in entry["english"].lower()
            or query_lower in entry["persian"]
            or (entry.get("finglish") and query_lower in entry["finglish"].lower())
        ):
            results.append(entry)
    return results
 def get_random_word(vocab_data=None, category=None):
    """Pick a random vocabulary entry, optionally filtered by category."""
    vocab = vocab_data or load_vocab()
    if category and category != "All":
        filtered = [e for e in vocab if e["category"] == category]
    else:
        filtered = vocab
    if not filtered:
        return None
    return random.choice(filtered)
 def get_flashcard_batch(count=10, category=None):
    """Get a batch of words for flashcard study.
    Prioritizes due words (FSRS), then fills with new/random words.
    """
    vocab = load_vocab()
    if category and category != "All":
        pool = [e for e in vocab if e["category"] == category]
    else:
        pool = vocab
    # Get due words first
    due_ids = db.get_due_words(limit=count)
    due_entries = [e for e in pool if e["id"] in due_ids]
    # Fill remaining with unseen or random words
    remaining = count - len(due_entries)
    if remaining > 0:
        seen_ids = {e["id"] for e in due_entries}
        # Prefer unseen words
        unseen = [e for e in pool if e["id"] not in seen_ids and not db.get_word_progress(e["id"])]
        if len(unseen) >= remaining:
            fill = random.sample(unseen, remaining)
        else:
            # Use all unseen + random from rest
            fill = unseen
            still_needed = remaining - len(fill)
            rest = [e for e in pool if e["id"] not in seen_ids and e not in fill]
            if rest:
                fill.extend(random.sample(rest, min(still_needed, len(rest))))
        due_entries.extend(fill)
    random.shuffle(due_entries)
    return due_entries
 def check_answer(word_id, user_answer, direction="en_to_fa"):
    """Check if user's answer matches the target word.
    Args:
        word_id: Vocabulary entry ID.
        user_answer: What the user typed.
        direction: "en_to_fa" (user writes Persian) or "fa_to_en" (user writes English).
    Returns:
        (is_correct, correct_answer, entry)
    """
    vocab = load_vocab()
    entry = next((e for e in vocab if e["id"] == word_id), None)
    if not entry:
        return False, "", None
    user_answer = user_answer.strip()
    if direction == "en_to_fa":
        correct = entry["persian"].strip()
        is_correct = user_answer == correct
    else:
        correct = entry["english"].strip().lower()
        is_correct = user_answer.lower() == correct
    return is_correct, correct if not is_correct else user_answer, entry
 def format_word_card(entry, show_transliteration="off"):
    """Format a vocabulary entry for display as RTL-safe markdown."""
    parts = []
    parts.append(f'<div dir="rtl" style="font-size:2em; text-align:center">{entry["persian"]}</div>')
    parts.append(f'<div style="font-size:1.3em; text-align:center">{entry["english"]}</div>')
    if show_transliteration != "off" and entry.get("finglish"):
        parts.append(f'<div style="text-align:center; color:#666; font-style:italic">{entry["finglish"]}</div>')
    parts.append(f'<div style="text-align:center; color:#999; font-size:0.9em">{entry.get("category", "")}</div>')
    return "\n".join(parts)
 def get_word_status(word_id):
    """Return status string for a word: new, learning, or mastered."""
    progress = db.get_word_progress(word_id)
    if not progress:
        return "new"
    if progress["stability"] and progress["stability"] > 10:
        return "mastered"
    return "learning"
--- a/python/persian-tutor/requirements.txt
+++ b/python/persian-tutor/requirements.txt
@@ -0,0 +1,3 @@
 gradio>=4.0
 genanki
 fsrs
--- a/python/persian-tutor/scripts/build_vocab.py
+++ b/python/persian-tutor/scripts/build_vocab.py
--- a/python/persian-tutor/scripts/generate_vocab.py
+++ b/python/persian-tutor/scripts/generate_vocab.py
@@ -0,0 +1,81 @@
 #!/usr/bin/env python3
 """One-time script to generate/update vocabulary.json with AI-assisted transliterations.
 Usage:
    python scripts/generate_vocab.py
 This reads an existing vocabulary.json, finds entries missing finglish
 transliterations, and uses Ollama to generate them.
 """
 import json
 import sys
 from pathlib import Path
 sys.path.insert(0, str(Path(__file__).parent.parent))
 from ai import ask_ollama
 VOCAB_PATH = Path(__file__).parent.parent / "data" / "vocabulary.json"
 def generate_transliterations(vocab):
    """Fill in missing finglish transliterations using AI."""
    missing = [e for e in vocab if not e.get("finglish")]
    if not missing:
        print("All entries already have finglish transliterations.")
        return vocab
    print(f"Generating transliterations for {len(missing)} entries...")
    # Process in batches of 20
    batch_size = 20
    for i in range(0, len(missing), batch_size):
        batch = missing[i : i + batch_size]
        pairs = "\n".join(f"{e['persian']} = {e['english']}" for e in batch)
        prompt = f"""For each Persian word below, provide the Finglish (romanized) transliteration.
 Use these conventions: â for آ, kh for خ, sh for ش, zh for ژ, gh for ق/غ, ch for چ.
 Reply with ONLY the transliterations, one per line, in the same order.
 {pairs}"""
        try:
            response = ask_ollama(prompt, model="qwen2.5:7b")
            lines = [l.strip() for l in response.strip().split("\n") if l.strip()]
            for j, entry in enumerate(batch):
                if j < len(lines):
                    # Clean up the response line
                    line = lines[j]
                    # Remove any numbering or equals signs
                    for sep in ["=", ":", "-", "."]:
                        if sep in line:
                            line = line.split(sep)[-1].strip()
                    entry["finglish"] = line
            print(f"  Processed {min(i + batch_size, len(missing))}/{len(missing)}")
        except Exception as e:
            print(f"  Error processing batch: {e}")
    return vocab
 def main():
    if not VOCAB_PATH.exists():
        print(f"No vocabulary file found at {VOCAB_PATH}")
        return
    with open(VOCAB_PATH, encoding="utf-8") as f:
        vocab = json.load(f)
    print(f"Loaded {len(vocab)} entries")
    vocab = generate_transliterations(vocab)
    with open(VOCAB_PATH, "w", encoding="utf-8") as f:
        json.dump(vocab, f, ensure_ascii=False, indent=2)
    print(f"Saved {len(vocab)} entries to {VOCAB_PATH}")
 if __name__ == "__main__":
    main()
--- a/python/persian-tutor/stt.py
+++ b/python/persian-tutor/stt.py
@@ -0,0 +1,65 @@
 """Persian speech-to-text wrapper using sttlib."""
 import sys
 import numpy as np
 sys.path.insert(0, "/home/ys/family-repo/Code/python/tool-speechtotext")
 from sttlib import load_whisper_model, transcribe, is_hallucination
 _model = None
 # Common Whisper hallucinations in Persian/silence
 PERSIAN_HALLUCINATIONS = [
    "ممنون",  # "thank you" hallucination
    "خداحافظ",  # "goodbye" hallucination
    "تماشا کنید",  # "watch" hallucination
    "لایک کنید",  # "like" hallucination
 ]
 def get_model(size="medium"):
    """Load Whisper model (cached singleton)."""
    global _model
    if _model is None:
        _model = load_whisper_model(size)
    return _model
 def transcribe_persian(audio_tuple):
    """Transcribe Persian audio from Gradio audio component.
    Args:
        audio_tuple: (sample_rate, numpy_array) from gr.Audio component.
    Returns:
        Transcribed text string, or empty string on failure/hallucination.
    """
    if audio_tuple is None:
        return ""
    sr, audio = audio_tuple
    model = get_model()
    # Convert to float32 normalized [-1, 1]
    if audio.dtype == np.int16:
        audio_float = audio.astype(np.float32) / 32768.0
    elif audio.dtype == np.float32:
        audio_float = audio
    else:
        audio_float = audio.astype(np.float32) / np.iinfo(audio.dtype).max
    # Mono conversion if stereo
    if audio_float.ndim > 1:
        audio_float = audio_float.mean(axis=1)
    # Use sttlib transcribe
    text = transcribe(model, audio_float)
    # Filter hallucinations (English + Persian)
    if is_hallucination(text):
        return ""
    if text.strip() in PERSIAN_HALLUCINATIONS:
        return ""
    return text
--- a/python/persian-tutor/tests/init.py
+++ b/python/persian-tutor/tests/init.py
--- a/python/persian-tutor/tests/test_ai.py
+++ b/python/persian-tutor/tests/test_ai.py
@@ -0,0 +1,89 @@
 """Tests for ai.py — dual AI backend."""
 import sys
 from pathlib import Path
 from unittest.mock import patch, MagicMock
 import pytest
 sys.path.insert(0, str(Path(__file__).parent.parent))
 import ai
 def test_ask_ollama_calls_ollama_chat():
    """ask_ollama should call ollama.chat with correct messages."""
    mock_response = MagicMock()
    mock_response.message.content = "test response"
    with patch("ai.ollama.chat", return_value=mock_response) as mock_chat:
        result = ai.ask_ollama("Hello", system="Be helpful")
        assert result == "test response"
        call_args = mock_chat.call_args
        messages = call_args.kwargs.get("messages") or call_args[1].get("messages")
        assert len(messages) == 2
        assert messages[0]["role"] == "system"
        assert messages[1]["role"] == "user"
        assert messages[1]["content"] == "Hello"
 def test_ask_ollama_no_system():
    """ask_ollama without system prompt should only send user message."""
    mock_response = MagicMock()
    mock_response.message.content = "response"
    with patch("ai.ollama.chat", return_value=mock_response) as mock_chat:
        ai.ask_ollama("Hi")
        call_args = mock_chat.call_args
        messages = call_args.kwargs.get("messages") or call_args[1].get("messages")
        assert len(messages) == 1
        assert messages[0]["role"] == "user"
 def test_ask_claude_calls_subprocess():
    """ask_claude should call claude CLI via subprocess."""
    with patch("ai.subprocess.run") as mock_run:
        mock_run.return_value = MagicMock(stdout="Claude says hi\n")
        result = ai.ask_claude("Hello")
        assert result == "Claude says hi"
        mock_run.assert_called_once()
        args = mock_run.call_args[0][0]
        assert args[0] == "claude"
        assert "-p" in args
 def test_ask_fast_uses_ollama():
    """ask with quality='fast' should use Ollama."""
    with patch("ai.ask_ollama", return_value="ollama response") as mock:
        result = ai.ask("test", quality="fast")
        assert result == "ollama response"
        mock.assert_called_once()
 def test_ask_smart_uses_claude():
    """ask with quality='smart' should use Claude."""
    with patch("ai.ask_claude", return_value="claude response") as mock:
        result = ai.ask("test", quality="smart")
        assert result == "claude response"
        mock.assert_called_once()
 def test_chat_ollama():
    """chat_ollama should pass multi-turn messages."""
    mock_response = MagicMock()
    mock_response.message.content = "continuation"
    with patch("ai.ollama.chat", return_value=mock_response) as mock_chat:
        messages = [
            {"role": "user", "content": "Hi"},
            {"role": "assistant", "content": "Hello!"},
            {"role": "user", "content": "How are you?"},
        ]
        result = ai.chat_ollama(messages, system="Be helpful")
        assert result == "continuation"
        call_args = mock_chat.call_args
        all_msgs = call_args.kwargs.get("messages") or call_args[1].get("messages")
        # system + 3 conversation messages
        assert len(all_msgs) == 4
--- a/python/persian-tutor/tests/test_anki_export.py
+++ b/python/persian-tutor/tests/test_anki_export.py
@@ -0,0 +1,86 @@
 """Tests for anki_export.py — Anki .apkg generation."""
 import os
 import sys
 import tempfile
 import zipfile
 from pathlib import Path
 import pytest
 sys.path.insert(0, str(Path(__file__).parent.parent))
 from anki_export import export_deck
 SAMPLE_VOCAB = [
    {
        "id": "verb_go",
        "section": "High-frequency language",
        "category": "Common verbs",
        "english": "to go",
        "persian": "رفتن",
        "finglish": "raftan",
    },
    {
        "id": "verb_eat",
        "section": "High-frequency language",
        "category": "Common verbs",
        "english": "to eat",
        "persian": "خوردن",
        "finglish": "khordan",
    },
    {
        "id": "colour_red",
        "section": "High-frequency language",
        "category": "Colours",
        "english": "red",
        "persian": "قرمز",
        "finglish": "ghermez",
    },
 ]
 def test_export_deck_creates_file(tmp_path):
    """export_deck should create a valid .apkg file."""
    output = str(tmp_path / "test.apkg")
    result = export_deck(SAMPLE_VOCAB, output_path=output)
    assert result == output
    assert os.path.exists(output)
    assert os.path.getsize(output) > 0
 def test_export_deck_is_valid_zip(tmp_path):
    """An .apkg file is a zip archive containing an Anki SQLite database."""
    output = str(tmp_path / "test.apkg")
    export_deck(SAMPLE_VOCAB, output_path=output)
    assert zipfile.is_zipfile(output)
 def test_export_deck_with_category_filter(tmp_path):
    """export_deck with category filter should only include matching entries."""
    output = str(tmp_path / "test.apkg")
    export_deck(SAMPLE_VOCAB, categories=["Colours"], output_path=output)
    # File should exist and be smaller than unfiltered
    assert os.path.exists(output)
    size_filtered = os.path.getsize(output)
    output2 = str(tmp_path / "test_all.apkg")
    export_deck(SAMPLE_VOCAB, output_path=output2)
    size_all = os.path.getsize(output2)
    # Filtered deck should be smaller (fewer cards)
    assert size_filtered <= size_all
 def test_export_deck_empty_vocab(tmp_path):
    """export_deck with empty vocabulary should still create a valid file."""
    output = str(tmp_path / "test.apkg")
    export_deck([], output_path=output)
    assert os.path.exists(output)
 def test_export_deck_no_category_match(tmp_path):
    """export_deck with non-matching category filter should create empty deck."""
    output = str(tmp_path / "test.apkg")
    export_deck(SAMPLE_VOCAB, categories=["Nonexistent"], output_path=output)
    assert os.path.exists(output)
--- a/python/persian-tutor/tests/test_db.py
+++ b/python/persian-tutor/tests/test_db.py
@@ -0,0 +1,151 @@
 """Tests for db.py — SQLite database layer with FSRS integration."""
 import os
 import sys
 import tempfile
 from pathlib import Path
 from unittest.mock import patch
 import pytest
 # Add project root to path
 sys.path.insert(0, str(Path(__file__).parent.parent))
 import fsrs
@pytest.fixture(autouse=True)
 def temp_db(tmp_path):
    """Use a temporary database for each test."""
    import db as db_mod
    db_mod._conn = None
    db_mod.DB_PATH = tmp_path / "test.db"
    db_mod.init_db()
    yield db_mod
    db_mod.close()
 def test_init_db_creates_tables(temp_db):
    """init_db should create all required tables."""
    conn = temp_db.get_connection()
    tables = conn.execute(
        "SELECT name FROM sqlite_master WHERE type='table'"
    ).fetchall()
    table_names = {row["name"] for row in tables}
    assert "word_progress" in table_names
    assert "quiz_sessions" in table_names
    assert "essays" in table_names
    assert "tutor_sessions" in table_names
 def test_get_word_progress_nonexistent(temp_db):
    """Should return None for a word that hasn't been reviewed."""
    assert temp_db.get_word_progress("nonexistent") is None
 def test_update_and_get_word_progress(temp_db):
    """update_word_progress should create and update progress."""
    card = temp_db.update_word_progress("verb_go", fsrs.Rating.Good)
    assert card is not None
    assert card.stability is not None
    progress = temp_db.get_word_progress("verb_go")
    assert progress is not None
    assert progress["word_id"] == "verb_go"
    assert progress["reps"] == 1
    assert progress["fsrs_state"] is not None
 def test_update_word_progress_increments_reps(temp_db):
    """Reviewing the same word multiple times should increment reps."""
    temp_db.update_word_progress("verb_go", fsrs.Rating.Good)
    temp_db.update_word_progress("verb_go", fsrs.Rating.Easy)
    progress = temp_db.get_word_progress("verb_go")
    assert progress["reps"] == 2
 def test_get_due_words(temp_db):
    """get_due_words should return words that are due for review."""
    # A newly reviewed word with Rating.Again should be due soon
    temp_db.update_word_progress("verb_go", fsrs.Rating.Again)
    # An easy word should have a later due date
    temp_db.update_word_progress("verb_eat", fsrs.Rating.Easy)
    # Due words depend on timing; at minimum both should be in the system
    all_progress = temp_db.get_connection().execute(
        "SELECT word_id FROM word_progress"
    ).fetchall()
    assert len(all_progress) == 2
 def test_get_word_counts(temp_db):
    """get_word_counts should return correct counts."""
    counts = temp_db.get_word_counts(total_vocab_size=100)
    assert counts["total"] == 100
    assert counts["seen"] == 0
    assert counts["mastered"] == 0
    assert counts["due"] == 0
    temp_db.update_word_progress("verb_go", fsrs.Rating.Good)
    counts = temp_db.get_word_counts(total_vocab_size=100)
    assert counts["seen"] == 1
 def test_record_quiz_session(temp_db):
    """record_quiz_session should insert a quiz record."""
    temp_db.record_quiz_session("Common verbs", 10, 7, 120)
    rows = temp_db.get_connection().execute(
        "SELECT * FROM quiz_sessions"
    ).fetchall()
    assert len(rows) == 1
    assert rows[0]["correct"] == 7
    assert rows[0]["total_questions"] == 10
 def test_save_essay(temp_db):
    """save_essay should store the essay and feedback."""
    temp_db.save_essay("متن آزمایشی", "B1", "Good effort!", "Identity and culture")
    essays = temp_db.get_recent_essays()
    assert len(essays) == 1
    assert essays[0]["grade"] == "B1"
 def test_save_tutor_session(temp_db):
    """save_tutor_session should store the conversation."""
    messages = [
        {"role": "user", "content": "سلام"},
        {"role": "assistant", "content": "سلام! حالت چطوره؟"},
    ]
    temp_db.save_tutor_session("Identity and culture", messages, 300)
    rows = temp_db.get_connection().execute(
        "SELECT * FROM tutor_sessions"
    ).fetchall()
    assert len(rows) == 1
    assert rows[0]["theme"] == "Identity and culture"
 def test_get_stats(temp_db):
    """get_stats should return aggregated stats."""
    stats = temp_db.get_stats()
    assert stats["total_reviews"] == 0
    assert stats["total_quizzes"] == 0
    assert stats["streak"] == 0
    assert isinstance(stats["recent_quizzes"], list)
 def test_close_and_reopen(temp_db):
    """Closing and reopening should preserve data."""
    temp_db.update_word_progress("verb_go", fsrs.Rating.Good)
    db_path = temp_db.DB_PATH
    temp_db.close()
    # Reopen
    temp_db._conn = None
    temp_db.DB_PATH = db_path
    temp_db.init_db()
    progress = temp_db.get_word_progress("verb_go")
    assert progress is not None
    assert progress["reps"] == 1
--- a/python/persian-tutor/tests/test_vocab.py
+++ b/python/persian-tutor/tests/test_vocab.py
@@ -0,0 +1,204 @@
 """Tests for modules/vocab.py — vocabulary search and flashcard logic."""
 import json
 import sys
 from pathlib import Path
 from unittest.mock import patch
 import pytest
 sys.path.insert(0, str(Path(__file__).parent.parent))
 SAMPLE_VOCAB = [
    {
        "id": "verb_go",
        "section": "High-frequency language",
        "category": "Common verbs",
        "english": "to go",
        "persian": "رفتن",
        "finglish": "raftan",
    },
    {
        "id": "verb_eat",
        "section": "High-frequency language",
        "category": "Common verbs",
        "english": "to eat",
        "persian": "خوردن",
        "finglish": "khordan",
    },
    {
        "id": "adj_big",
        "section": "High-frequency language",
        "category": "Common adjectives",
        "english": "big",
        "persian": "بزرگ",
        "finglish": "bozorg",
    },
    {
        "id": "colour_red",
        "section": "High-frequency language",
        "category": "Colours",
        "english": "red",
        "persian": "قرمز",
        "finglish": "ghermez",
    },
 ]
@pytest.fixture(autouse=True)
 def mock_vocab_and_db(tmp_path):
    """Mock vocabulary loading and use temp DB."""
    import db as db_mod
    import modules.vocab as vocab_mod
    # Temp DB
    db_mod._conn = None
    db_mod.DB_PATH = tmp_path / "test.db"
    db_mod.init_db()
    # Mock vocab
    vocab_mod._vocab_data = SAMPLE_VOCAB
    yield vocab_mod
    db_mod.close()
    vocab_mod._vocab_data = None
 def test_load_vocab(mock_vocab_and_db):
    """load_vocab should return the vocabulary data."""
    data = mock_vocab_and_db.load_vocab()
    assert len(data) == 4
 def test_get_categories(mock_vocab_and_db):
    """get_categories should return unique sorted categories."""
    cats = mock_vocab_and_db.get_categories()
    assert "Colours" in cats
    assert "Common verbs" in cats
    assert "Common adjectives" in cats
 def test_search_english(mock_vocab_and_db):
    """Search should find entries by English text."""
    results = mock_vocab_and_db.search("go")
    assert len(results) == 1
    assert results[0]["id"] == "verb_go"
 def test_search_persian(mock_vocab_and_db):
    """Search should find entries by Persian text."""
    results = mock_vocab_and_db.search("رفتن")
    assert len(results) == 1
    assert results[0]["id"] == "verb_go"
 def test_search_finglish(mock_vocab_and_db):
    """Search should find entries by Finglish text."""
    results = mock_vocab_and_db.search("raftan")
    assert len(results) == 1
    assert results[0]["id"] == "verb_go"
 def test_search_empty(mock_vocab_and_db):
    """Empty search should return empty list."""
    assert mock_vocab_and_db.search("") == []
    assert mock_vocab_and_db.search(None) == []
 def test_search_no_match(mock_vocab_and_db):
    """Search with no match should return empty list."""
    assert mock_vocab_and_db.search("zzzzz") == []
 def test_get_random_word(mock_vocab_and_db):
    """get_random_word should return a valid entry."""
    word = mock_vocab_and_db.get_random_word()
    assert word is not None
    assert "id" in word
    assert "english" in word
    assert "persian" in word
 def test_get_random_word_with_category(mock_vocab_and_db):
    """get_random_word with category filter should only return matching entries."""
    word = mock_vocab_and_db.get_random_word(category="Colours")
    assert word is not None
    assert word["category"] == "Colours"
 def test_get_random_word_nonexistent_category(mock_vocab_and_db):
    """get_random_word with bad category should return None."""
    word = mock_vocab_and_db.get_random_word(category="Nonexistent")
    assert word is None
 def test_check_answer_correct_en_to_fa(mock_vocab_and_db):
    """Correct Persian answer should be marked correct."""
    correct, answer, entry = mock_vocab_and_db.check_answer(
        "verb_go", "رفتن", direction="en_to_fa"
    )
    assert correct is True
 def test_check_answer_incorrect_en_to_fa(mock_vocab_and_db):
    """Incorrect Persian answer should be marked incorrect with correct answer."""
    correct, answer, entry = mock_vocab_and_db.check_answer(
        "verb_go", "خوردن", direction="en_to_fa"
    )
    assert correct is False
    assert answer == "رفتن"
 def test_check_answer_fa_to_en(mock_vocab_and_db):
    """Correct English answer (case-insensitive) should be marked correct."""
    correct, answer, entry = mock_vocab_and_db.check_answer(
        "verb_go", "To Go", direction="fa_to_en"
    )
    assert correct is True
 def test_check_answer_nonexistent_word(mock_vocab_and_db):
    """Checking answer for nonexistent word should return False."""
    correct, answer, entry = mock_vocab_and_db.check_answer(
        "nonexistent", "test", direction="en_to_fa"
    )
    assert correct is False
    assert entry is None
 def test_format_word_card(mock_vocab_and_db):
    """format_word_card should produce RTL HTML with correct content."""
    entry = SAMPLE_VOCAB[0]
    html = mock_vocab_and_db.format_word_card(entry, show_transliteration="Finglish")
    assert "رفتن" in html
    assert "to go" in html
    assert "raftan" in html
 def test_format_word_card_no_transliteration(mock_vocab_and_db):
    """format_word_card with transliteration off should not show finglish."""
    entry = SAMPLE_VOCAB[0]
    html = mock_vocab_and_db.format_word_card(entry, show_transliteration="off")
    assert "raftan" not in html
 def test_get_flashcard_batch(mock_vocab_and_db):
    """get_flashcard_batch should return a batch of entries."""
    batch = mock_vocab_and_db.get_flashcard_batch(count=2)
    assert len(batch) == 2
    assert all("id" in e for e in batch)
 def test_get_word_status_new(mock_vocab_and_db):
    """Unreviewed word should have status 'new'."""
    assert mock_vocab_and_db.get_word_status("verb_go") == "new"
 def test_get_word_status_learning(mock_vocab_and_db):
    """Recently reviewed word should have status 'learning'."""
    import db
    import fsrs
    db.update_word_progress("verb_go", fsrs.Rating.Good)
    assert mock_vocab_and_db.get_word_status("verb_go") == "learning"
--- a/python/tool-speechtotext/.vscode/settings.json
+++ b/python/tool-speechtotext/.vscode/settings.json
@@ -1,5 +1,11 @@
 {
    "python-envs.defaultEnvManager": "ms-python.python:conda",
    "python-envs.defaultPackageManager": "ms-python.python:conda",
-    "python-envs.pythonProjects": []
+    "python-envs.pythonProjects": [],
    "python.testing.pytestEnabled": true,
    "python.testing.unittestEnabled": false,
    "python.testing.pytestArgs": [
        "tests",
        "-v"
    ]
 }
--- a/python/tool-speechtotext/CLAUDE.md
+++ b/python/tool-speechtotext/CLAUDE.md
@@ -12,11 +12,20 @@ Speech-to-text command line utilities leveraging local models (faster-whisper, O
 ## Tools
 - `assistant.py` / `talk.sh` — transcribe speech, copy to clipboard, optionally send to Ollama
 - `voice_to_terminal.py` / `terminal.sh` — voice-controlled terminal via Ollama tool calling
- `voice_to_xdotool.py` / `dotool.sh` — hands-free voice typing into any focused window (VAD + xdotool)
+- `voice_to_xdotool.py` / `xdotool.sh` — hands-free voice typing into any focused window (VAD + xdotool)
 ## Shared Library
 - `sttlib/` — shared package used by all scripts and importable by other projects
  - `whisper_loader.py` — model loading with GPU→CPU fallback
  - `audio.py` — press-enter recording, PCM conversion
  - `transcription.py` — Whisper transcribe wrapper, hallucination filter
  - `vad.py` — VADProcessor, audio callback, constants
 - Other projects import via: `sys.path.insert(0, "/path/to/tool-speechtotext")`
 ## Testing
- To test scripts: `mamba run -n whisper-ollama python <script.py> --model-size base`
+- Run tests: `mamba run -n whisper-ollama python -m pytest tests/`
 - Use `--model-size base` for faster iteration during development
 - Tests mock hardware (Whisper model, VAD, mic) — no GPU/mic needed to run them
 - Audio device is available — live mic testing is possible
 - Test xdotool output by focusing a text editor window
@@ -24,12 +33,13 @@ Speech-to-text command line utilities leveraging local models (faster-whisper, O
 - Conda: faster-whisper, sounddevice, numpy, pyperclip, requests, ollama
 - Pip (in conda env): webrtcvad
 - System: libportaudio2, xdotool
 - Dev: pytest
 ## Conventions
 - Shell wrappers go in .sh files using `mamba run -n whisper-ollama`
- All scripts set `CT2_CUDA_ALLOW_FP16=1`
+- Shared code lives in `sttlib/` — scripts are thin entry points that import from it
 - Whisper model loading always has GPU (cuda/float16) -> CPU (cpu/int8) fallback
- Keep scripts self-contained (no shared module)
+- `CT2_CUDA_ALLOW_FP16=1` is set by `sttlib.whisper_loader` at import time
 - Don't print output for non-actionable events
 ## Preferences
--- a/python/tool-speechtotext/assistant.py
+++ b/python/tool-speechtotext/assistant.py
@@ -1,57 +1,17 @@
-import sounddevice as sd
+import argparse
 import numpy as np
 import pyperclip
 import requests
-import sys
+from sttlib import load_whisper_model, record_until_enter, transcribe
 import argparse
 from faster_whisper import WhisperModel
 import os
 os.environ["CT2_CUDA_ALLOW_FP16"] = "1"
 # --- Configuration ---
-MODEL_SIZE = "medium"  # Options: "base", "small", "medium", "large-v3"
+OLLAMA_URL = "http://localhost:11434/api/generate"
 OLLAMA_URL = "http://localhost:11434/api/generate"  # Default is 11434
 DEFAULT_OLLAMA_MODEL = "qwen3:latest"
 # Load Whisper on GPU
 # float16 is faster and uses less VRAM on NVIDIA cards
 print("Loading Whisper model...")
 try:
    model = WhisperModel(MODEL_SIZE, device="cuda", compute_type="float16")
 except Exception as e:
    print(f"Error loading GPU: {e}")
    print("Falling back to CPU (Check your CUDA/cuDNN installation)")
    model = WhisperModel(MODEL_SIZE, device="cuda", compute_type="int16")
 def record_audio():
    fs = 16000
    print("\n[READY] Press Enter to START recording...")
    input()
    print("[RECORDING] Press Enter to STOP...")
    recording = []
    def callback(indata, frames, time, status):
        if status:
            print(status, file=sys.stderr)
        recording.append(indata.copy())
    with sd.InputStream(samplerate=fs, channels=1, callback=callback):
        input()
    return np.concatenate(recording, axis=0)
 def main():
    # 1. Setup Parser
    print(f"System active. Model: {DEFAULT_OLLAMA_MODEL}")
    parser = argparse.ArgumentParser(description="Whisper + Ollama CLI")
    # Known Arguments (Hardcoded logic)
    parser.add_argument("--nollm", "-n", action='store_true',
                        help="turn off llm")
    parser.add_argument("--system", "-s", default=None,
@@ -65,30 +25,27 @@ def main():
    parser.add_argument(
        "--temp", default='0.7', help="temperature")
    # 2. Capture "Unknown" arguments
    # args = known values, unknown = a list like ['--num_ctx', '4096', '--temp', '0.7']
    args, unknown = parser.parse_known_args()
    # Convert unknown list to a dictionary for the Ollama 'options' field
    # This logic pairs ['--key', 'value'] into {key: value}
    extra_options = {}
    for i in range(0, len(unknown), 2):
-        key = unknown[i].lstrip('-')  # remove the '--'
+        key = unknown[i].lstrip('-')
        val = unknown[i+1]
        # Try to convert numbers to actual ints/floats
        try:
            val = float(val) if '.' in val else int(val)
        except ValueError:
            pass
        extra_options[key] = val
    model = load_whisper_model(args.model_size)
    while True:
        try:
-            audio_data = record_audio()
+            audio_data = record_until_enter()
            print("[TRANSCRIBING]...")
-            segments, _ = model.transcribe(audio_data.flatten(), beam_size=5)
+            text = transcribe(model, audio_data.flatten())
            text = "".join([segment.text for segment in segments]).strip()
            if not text:
                print("No speech detected. Try again.")
@@ -97,8 +54,7 @@ def main():
            print(f"You said: {text}")
            pyperclip.copy(text)
-            if (args.nollm == False):
+            if not args.nollm:
                # Send to Ollama
                print(f"[OLLAMA] Thinking...")
                payload = {
                    "model": args.ollama_model,
@@ -108,9 +64,9 @@ def main():
                }
                if args.system:
-                    payload["system"] = args
+                    payload["system"] = args.system
                response = requests.post(OLLAMA_URL, json=payload)
                response = requests.post(OLLAMA_URL, json=payload)
                result = response.json().get("response", "")
                print(f"\nLLM Response:\n{result}\n")
            else:
--- a/python/tool-speechtotext/sttlib/init.py
+++ b/python/tool-speechtotext/sttlib/init.py
@@ -0,0 +1,7 @@
 from sttlib.whisper_loader import load_whisper_model
 from sttlib.audio import record_until_enter, pcm_bytes_to_float32
 from sttlib.transcription import transcribe, is_hallucination, HALLUCINATION_PATTERNS
 from sttlib.vad import (
    VADProcessor, audio_callback, audio_queue,
    SAMPLE_RATE, CHANNELS, FRAME_DURATION_MS, FRAME_SIZE, MIN_UTTERANCE_FRAMES,
 )
--- a/python/tool-speechtotext/sttlib/audio.py
+++ b/python/tool-speechtotext/sttlib/audio.py
@@ -0,0 +1,28 @@
 import sys
 import numpy as np
 import sounddevice as sd
 def record_until_enter(sample_rate=16000):
    """Record audio until user presses Enter. Returns float32 numpy array."""
    print("\n[READY] Press Enter to START recording...")
    input()
    print("[RECORDING] Press Enter to STOP...")
    recording = []
    def callback(indata, frames, time, status):
        if status:
            print(status, file=sys.stderr)
        recording.append(indata.copy())
    with sd.InputStream(samplerate=sample_rate, channels=1, callback=callback):
        input()
    return np.concatenate(recording, axis=0)
 def pcm_bytes_to_float32(pcm_bytes):
    """Convert raw 16-bit PCM bytes to float32 array normalized to [-1, 1]."""
    audio_int16 = np.frombuffer(pcm_bytes, dtype=np.int16)
    return audio_int16.astype(np.float32) / 32768.0
--- a/python/tool-speechtotext/sttlib/transcription.py
+++ b/python/tool-speechtotext/sttlib/transcription.py
@@ -0,0 +1,19 @@
 HALLUCINATION_PATTERNS = [
    "thank you", "thanks for watching", "subscribe",
    "bye", "the end", "thank you for watching",
    "please subscribe", "like and subscribe",
 ]
 def transcribe(model, audio_float32):
    """Transcribe audio using Whisper. Returns stripped text."""
    segments, _ = model.transcribe(audio_float32, beam_size=5)
    return "".join(segment.text for segment in segments).strip()
 def is_hallucination(text):
    """Return True if text looks like a Whisper hallucination."""
    lowered = text.lower().strip()
    if len(lowered) < 3:
        return True
    return any(p in lowered for p in HALLUCINATION_PATTERNS)
--- a/python/tool-speechtotext/sttlib/vad.py
+++ b/python/tool-speechtotext/sttlib/vad.py
@@ -0,0 +1,58 @@
 import sys
 import queue
 import collections
 import webrtcvad
 SAMPLE_RATE = 16000
 CHANNELS = 1
 FRAME_DURATION_MS = 30
 FRAME_SIZE = int(SAMPLE_RATE * FRAME_DURATION_MS / 1000)  # 480 samples
 MIN_UTTERANCE_FRAMES = 10  # ~300ms minimum to filter coughs/clicks
 audio_queue = queue.Queue()
 def audio_callback(indata, frames, time_info, status):
    """sounddevice callback that pushes raw bytes to the audio queue."""
    if status:
        print(status, file=sys.stderr)
    audio_queue.put(bytes(indata))
 class VADProcessor:
    def __init__(self, aggressiveness, silence_threshold):
        self.vad = webrtcvad.Vad(aggressiveness)
        self.silence_threshold = silence_threshold
        self.reset()
    def reset(self):
        self.triggered = False
        self.utterance_frames = []
        self.silence_duration = 0.0
        self.pre_buffer = collections.deque(maxlen=10)  # ~300ms pre-roll
    def process_frame(self, frame_bytes):
        """Process one 30ms frame. Returns utterance bytes when complete, else None."""
        is_speech = self.vad.is_speech(frame_bytes, SAMPLE_RATE)
        if not self.triggered:
            self.pre_buffer.append(frame_bytes)
            if is_speech:
                self.triggered = True
                self.silence_duration = 0.0
                self.utterance_frames = list(self.pre_buffer)
                self.utterance_frames.append(frame_bytes)
        else:
            self.utterance_frames.append(frame_bytes)
            if is_speech:
                self.silence_duration = 0.0
            else:
                self.silence_duration += FRAME_DURATION_MS / 1000.0
                if self.silence_duration >= self.silence_threshold:
                    if len(self.utterance_frames) < MIN_UTTERANCE_FRAMES:
                        self.reset()
                        return None
                    result = b"".join(self.utterance_frames)
                    self.reset()
                    return result
        return None
--- a/python/tool-speechtotext/sttlib/whisper_loader.py
+++ b/python/tool-speechtotext/sttlib/whisper_loader.py
@@ -0,0 +1,15 @@
 import os
 from faster_whisper import WhisperModel
 os.environ["CT2_CUDA_ALLOW_FP16"] = "1"
 def load_whisper_model(model_size):
    """Load Whisper with GPU (cuda/float16) -> CPU (cpu/int8) fallback."""
    print(f"Loading Whisper model ({model_size})...")
    try:
        return WhisperModel(model_size, device="cuda", compute_type="float16")
    except Exception as e:
        print(f"GPU loading failed: {e}")
        print("Falling back to CPU (int8)")
        return WhisperModel(model_size, device="cpu", compute_type="int8")
--- a/python/tool-speechtotext/tests/init.py
+++ b/python/tool-speechtotext/tests/init.py
--- a/python/tool-speechtotext/tests/test_audio.py
+++ b/python/tool-speechtotext/tests/test_audio.py
@@ -0,0 +1,38 @@
 import struct
 import numpy as np
 from sttlib.audio import pcm_bytes_to_float32
 def test_known_value():
    # 16384 in int16 -> 0.5 in float32
    pcm = struct.pack("<h", 16384)
    result = pcm_bytes_to_float32(pcm)
    assert abs(result[0] - 0.5) < 1e-5
 def test_silence():
    pcm = b"\x00\x00" * 10
    result = pcm_bytes_to_float32(pcm)
    assert np.all(result == 0.0)
 def test_full_scale():
    # max int16 = 32767 -> ~1.0
    pcm = struct.pack("<h", 32767)
    result = pcm_bytes_to_float32(pcm)
    assert abs(result[0] - (32767 / 32768.0)) < 1e-5
 def test_negative():
    # min int16 = -32768 -> -1.0
    pcm = struct.pack("<h", -32768)
    result = pcm_bytes_to_float32(pcm)
    assert result[0] == -1.0
 def test_round_trip_shape():
    # 100 samples worth of bytes
    pcm = b"\x00\x00" * 100
    result = pcm_bytes_to_float32(pcm)
    assert result.shape == (100,)
    assert result.dtype == np.float32
--- a/python/tool-speechtotext/tests/test_transcription.py
+++ b/python/tool-speechtotext/tests/test_transcription.py
@@ -0,0 +1,78 @@
 from unittest.mock import MagicMock
 from sttlib.transcription import transcribe, is_hallucination
 # --- is_hallucination tests ---
 def test_known_hallucinations():
    assert is_hallucination("Thank you")
    assert is_hallucination("thanks for watching")
    assert is_hallucination("Subscribe")
    assert is_hallucination("the end")
 def test_short_text():
    assert is_hallucination("hi")
    assert is_hallucination("")
    assert is_hallucination("a")
 def test_normal_text():
    assert not is_hallucination("Hello how are you")
    assert not is_hallucination("Please open the terminal")
 def test_case_insensitivity():
    assert is_hallucination("THANK YOU")
    assert is_hallucination("Thank You For Watching")
 def test_substring_match():
    assert is_hallucination("I want to subscribe to your channel")
 def test_exactly_three_chars():
    assert not is_hallucination("hey")
 # --- transcribe tests ---
 def _make_segment(text):
    seg = MagicMock()
    seg.text = text
    return seg
 def test_transcribe_joins_segments():
    model = MagicMock()
    model.transcribe.return_value = (
        [_make_segment("Hello "), _make_segment("world")],
        None,
    )
    result = transcribe(model, MagicMock())
    assert result == "Hello world"
 def test_transcribe_empty():
    model = MagicMock()
    model.transcribe.return_value = ([], None)
    result = transcribe(model, MagicMock())
    assert result == ""
 def test_transcribe_strips_whitespace():
    model = MagicMock()
    model.transcribe.return_value = (
        [_make_segment("  hello  ")],
        None,
    )
    result = transcribe(model, MagicMock())
    assert result == "hello"
 def test_transcribe_passes_beam_size():
    model = MagicMock()
    model.transcribe.return_value = ([], None)
    audio = MagicMock()
    transcribe(model, audio)
    model.transcribe.assert_called_once_with(audio, beam_size=5)
--- a/python/tool-speechtotext/tests/test_vad.py
+++ b/python/tool-speechtotext/tests/test_vad.py
@@ -0,0 +1,151 @@
 from unittest.mock import patch, MagicMock
 from sttlib.vad import VADProcessor, FRAME_DURATION_MS, MIN_UTTERANCE_FRAMES
 def _make_vad_processor(aggressiveness=3, silence_threshold=0.8):
    """Create VADProcessor with a mocked webrtcvad.Vad."""
    with patch("sttlib.vad.webrtcvad.Vad") as mock_vad_cls:
        mock_vad = MagicMock()
        mock_vad_cls.return_value = mock_vad
        proc = VADProcessor(aggressiveness, silence_threshold)
    return proc, mock_vad
 def _frame(label="x"):
    """Return a fake 30ms frame (just needs to be distinct bytes)."""
    return label.encode() * 960  # 480 samples * 2 bytes
 def test_no_speech_returns_none():
    proc, mock_vad = _make_vad_processor()
    mock_vad.is_speech.return_value = False
    for _ in range(100):
        assert proc.process_frame(_frame()) is None
 def test_speech_then_silence_triggers_utterance():
    proc, mock_vad = _make_vad_processor(silence_threshold=0.3)
    # Feed enough speech frames
    speech_count = MIN_UTTERANCE_FRAMES + 5
    mock_vad.is_speech.return_value = True
    for _ in range(speech_count):
        result = proc.process_frame(_frame("s"))
        assert result is None  # not done yet
    # Feed silence frames until threshold (0.3s = 10 frames at 30ms)
    mock_vad.is_speech.return_value = False
    result = None
    for _ in range(20):
        result = proc.process_frame(_frame("q"))
        if result is not None:
            break
    assert result is not None
    assert len(result) > 0
 def test_short_utterance_filtered():
    # Use very short silence threshold so silence frames don't push total
    # past MIN_UTTERANCE_FRAMES. With threshold=0.09s (3 frames of silence):
    # 0 pre-buffer + 1 speech + 3 silence = 4 total < MIN_UTTERANCE_FRAMES (10)
    proc, mock_vad = _make_vad_processor(silence_threshold=0.09)
    # Single speech frame triggers VAD
    mock_vad.is_speech.return_value = True
    proc.process_frame(_frame("s"))
    # Immediately go silent — threshold reached in 3 frames
    mock_vad.is_speech.return_value = False
    result = None
    for _ in range(20):
        result = proc.process_frame(_frame("q"))
        if result is not None:
            break
    # Should be filtered (too short — only 4 total frames)
    assert result is None
 def test_pre_buffer_included():
    proc, mock_vad = _make_vad_processor(silence_threshold=0.3)
    # Fill pre-buffer with non-speech frames
    mock_vad.is_speech.return_value = False
    pre_frame = _frame("p")
    for _ in range(10):
        proc.process_frame(pre_frame)
    # Speech starts
    mock_vad.is_speech.return_value = True
    speech_frame = _frame("s")
    for _ in range(MIN_UTTERANCE_FRAMES):
        proc.process_frame(speech_frame)
    # Silence to trigger
    mock_vad.is_speech.return_value = False
    result = None
    for _ in range(20):
        result = proc.process_frame(_frame("q"))
        if result is not None:
            break
    assert result is not None
    # Result should contain pre-buffer frames
    assert pre_frame in result
 def test_reset_after_utterance():
    proc, mock_vad = _make_vad_processor(silence_threshold=0.3)
    # First utterance
    mock_vad.is_speech.return_value = True
    for _ in range(MIN_UTTERANCE_FRAMES + 5):
        proc.process_frame(_frame("s"))
    mock_vad.is_speech.return_value = False
    for _ in range(20):
        result = proc.process_frame(_frame("q"))
        if result is not None:
            break
    assert result is not None
    # After reset, should be able to collect a second utterance
    assert not proc.triggered
    assert proc.utterance_frames == []
    mock_vad.is_speech.return_value = True
    for _ in range(MIN_UTTERANCE_FRAMES + 5):
        proc.process_frame(_frame("s"))
    mock_vad.is_speech.return_value = False
    result2 = None
    for _ in range(20):
        result2 = proc.process_frame(_frame("q"))
        if result2 is not None:
            break
    assert result2 is not None
 def test_silence_threshold_boundary():
    # Use 0.3s threshold: 0.3 / 0.03 = exactly 10 frames needed
    threshold = 0.3
    proc, mock_vad = _make_vad_processor(silence_threshold=threshold)
    # Start with speech
    mock_vad.is_speech.return_value = True
    for _ in range(MIN_UTTERANCE_FRAMES + 5):
        proc.process_frame(_frame("s"))
    frames_needed = 10  # 0.3s / 0.03s per frame
    mock_vad.is_speech.return_value = False
    # Feed one less than needed — should NOT trigger
    for i in range(frames_needed - 1):
        result = proc.process_frame(_frame("q"))
        assert result is None, f"Triggered too early at frame {i}"
    # The 10th frame should trigger (silence_duration = 0.3 >= 0.3)
    result = proc.process_frame(_frame("q"))
    assert result is not None
--- a/python/tool-speechtotext/tests/test_whisper_loader.py
+++ b/python/tool-speechtotext/tests/test_whisper_loader.py
@@ -0,0 +1,37 @@
 from unittest.mock import patch, MagicMock
 from sttlib.whisper_loader import load_whisper_model
@patch("sttlib.whisper_loader.WhisperModel")
 def test_gpu_success(mock_cls):
    mock_model = MagicMock()
    mock_cls.return_value = mock_model
    result = load_whisper_model("base")
    assert result is mock_model
    mock_cls.assert_called_once_with("base", device="cuda", compute_type="float16")
@patch("sttlib.whisper_loader.WhisperModel")
 def test_gpu_fails_cpu_fallback(mock_cls):
    mock_model = MagicMock()
    mock_cls.side_effect = [RuntimeError("no CUDA"), mock_model]
    result = load_whisper_model("base")
    assert result is mock_model
    assert mock_cls.call_count == 2
    _, kwargs = mock_cls.call_args
    assert kwargs == {"device": "cpu", "compute_type": "int8"}
@patch("sttlib.whisper_loader.WhisperModel")
 def test_both_fail_propagates(mock_cls):
    mock_cls.side_effect = RuntimeError("no device")
    try:
        load_whisper_model("base")
        assert False, "Should have raised"
    except RuntimeError:
        pass
--- a/python/tool-speechtotext/voice_to_terminal.py
+++ b/python/tool-speechtotext/voice_to_terminal.py
@@ -1,31 +1,16 @@
 import sounddevice as sd
 import numpy as np
 import pyperclip
 import sys
 import argparse
 import os
 import subprocess
 import ollama
 import json
-from faster_whisper import WhisperModel
+import ollama
 from sttlib import load_whisper_model, record_until_enter, transcribe
 # --- Configuration ---
 os.environ["CT2_CUDA_ALLOW_FP16"] = "1"
 MODEL_SIZE = "medium"
 OLLAMA_MODEL = "qwen2.5-coder:7b"
 CONFIRM_COMMANDS = True  # Set to False to run commands instantly
 # Load Whisper on GPU
 print("Loading Whisper model...")
 try:
    model = WhisperModel(MODEL_SIZE, device="cuda", compute_type="float16")
 except Exception as e:
    print(f"Error loading GPU: {e}, falling back to CPU")
    model = WhisperModel(MODEL_SIZE, device="cpu", compute_type="int8")
 # --- Terminal Tool ---
 def run_terminal_command(command: str):
    """
    Executes a bash command in the Linux terminal.
@@ -33,8 +18,7 @@ def run_terminal_command(command: str):
    """
    if CONFIRM_COMMANDS:
        print(f"\n{'='*40}")
-        print(f"⚠️  AI SUGGESTED: \033[1;32m{command}\033[0m")
+        print(f"\u26a0\ufe0f  AI SUGGESTED: \033[1;32m{command}\033[0m")
        # Allow user to provide feedback if they say 'n'
        choice = input("   Confirm? [Y/n] or provide feedback: ").strip()
        if choice.lower() == 'n':
@@ -57,22 +41,15 @@ def run_terminal_command(command: str):
        return f"Execution Error: {str(e)}"
 def record_audio():
    fs, recording = 16000, []
    print("\n[READY] Press Enter to START...")
    input()
    print("[RECORDING] Press Enter to STOP...")
    def cb(indata, f, t, s): recording.append(indata.copy())
    with sd.InputStream(samplerate=fs, channels=1, callback=cb):
        input()
    return np.concatenate(recording, axis=0)
 def main():
    parser = argparse.ArgumentParser()
    parser.add_argument("--model", default=OLLAMA_MODEL)
    parser.add_argument("--model-size", default="medium",
                        help="Whisper model size")
    args, _ = parser.parse_known_args()
    whisper_model = load_whisper_model(args.model_size)
    # Initial System Prompt
    messages = [{
        'role': 'system',
@@ -88,9 +65,8 @@ def main():
    while True:
        try:
            # 1. Voice Capture
-            audio_data = record_audio()
+            audio_data = record_until_enter()
-            segments, _ = model.transcribe(audio_data.flatten(), beam_size=5)
+            user_text = transcribe(whisper_model, audio_data.flatten())
            user_text = "".join([s.text for s in segments]).strip()
            if not user_text:
                continue
--- a/python/tool-speechtotext/voice_to_xdotool.py
+++ b/python/tool-speechtotext/voice_to_xdotool.py
@@ -1,91 +1,14 @@
 import sounddevice as sd
 import numpy as np
 import webrtcvad
 import subprocess
 import sys
 import os
 import argparse
 import subprocess
 import threading
 import queue
 import collections
 import time
-from faster_whisper import WhisperModel
+import sounddevice as sd
-
+from sttlib import (
-os.environ["CT2_CUDA_ALLOW_FP16"] = "1"
+    load_whisper_model, transcribe, is_hallucination, pcm_bytes_to_float32,
-
+    VADProcessor, audio_callback, audio_queue,
-# --- Constants ---
+    SAMPLE_RATE, CHANNELS, FRAME_SIZE,
-SAMPLE_RATE = 16000
+)
 CHANNELS = 1
 FRAME_DURATION_MS = 30
 FRAME_SIZE = int(SAMPLE_RATE * FRAME_DURATION_MS / 1000)  # 480 samples
 MIN_UTTERANCE_FRAMES = 10  # ~300ms minimum to filter coughs/clicks
 HALLUCINATION_PATTERNS = [
    "thank you", "thanks for watching", "subscribe",
    "bye", "the end", "thank you for watching",
    "please subscribe", "like and subscribe",
 ]
 # --- Thread-safe audio queue ---
 audio_queue = queue.Queue()
 def audio_callback(indata, frames, time_info, status):
    if status:
        print(status, file=sys.stderr)
    audio_queue.put(bytes(indata))
 # --- Whisper model loading (reused pattern from assistant.py) ---
 def load_whisper_model(model_size):
    print(f"Loading Whisper model ({model_size})...")
    try:
        return WhisperModel(model_size, device="cuda", compute_type="float16")
    except Exception as e:
        print(f"GPU loading failed: {e}")
        print("Falling back to CPU (int8)")
        return WhisperModel(model_size, device="cpu", compute_type="int8")
 # --- VAD State Machine ---
 class VADProcessor:
    def __init__(self, aggressiveness, silence_threshold):
        self.vad = webrtcvad.Vad(aggressiveness)
        self.silence_threshold = silence_threshold
        self.reset()
    def reset(self):
        self.triggered = False
        self.utterance_frames = []
        self.silence_duration = 0.0
        self.pre_buffer = collections.deque(maxlen=10)  # ~300ms pre-roll
    def process_frame(self, frame_bytes):
        """Process one 30ms frame. Returns utterance bytes when complete, else None."""
        is_speech = self.vad.is_speech(frame_bytes, SAMPLE_RATE)
        if not self.triggered:
            self.pre_buffer.append(frame_bytes)
            if is_speech:
                self.triggered = True
                self.silence_duration = 0.0
                self.utterance_frames = list(self.pre_buffer)
                self.utterance_frames.append(frame_bytes)
                pass  # silent until transcription confirms speech
        else:
            self.utterance_frames.append(frame_bytes)
            if is_speech:
                self.silence_duration = 0.0
            else:
                self.silence_duration += FRAME_DURATION_MS / 1000.0
                if self.silence_duration >= self.silence_threshold:
                    if len(self.utterance_frames) < MIN_UTTERANCE_FRAMES:
                        self.reset()
                        return None
                    result = b"".join(self.utterance_frames)
                    self.reset()
                    return result
        return None
 # --- Typer Interface (xdotool) ---
@@ -99,6 +22,7 @@ class Typer:
        except FileNotFoundError:
            print("ERROR: xdotool not found. Install it:")
            print("  sudo apt-get install xdotool")
            import sys
            sys.exit(1)
    def type_text(self, text, submit_now=False):
@@ -120,24 +44,6 @@ class Typer:
        pass
 # --- Helpers ---
 def pcm_bytes_to_float32(pcm_bytes):
    audio_int16 = np.frombuffer(pcm_bytes, dtype=np.int16)
    return audio_int16.astype(np.float32) / 32768.0
 def transcribe(model, audio_float32):
    segments, _ = model.transcribe(audio_float32, beam_size=5)
    return "".join(segment.text for segment in segments).strip()
 def is_hallucination(text):
    lowered = text.lower().strip()
    if len(lowered) < 3:
        return True
    return any(p in lowered for p in HALLUCINATION_PATTERNS)
 # --- CLI ---
 def parse_args():
    parser = argparse.ArgumentParser(