Train local LLMs on real enterprise UIs, then let product managers vibe-code production interfaces. Zero API costs. One GPU. Future-proof for AI agents.
View on GitHubCapture real enterprise UIs — banking dashboards, workflow forms, transaction screens — with Playwright. Feed them into a local 14B model and fine-tune with QLoRA. Now your team has an AI assistant that understands your design system and it never leaves your network.
Instead of one mockup per review cycle, generate 20 UI variations in the time it takes to write one ticket. PMs describe what they need — “a transaction history table with filters and export” — and explore options instantly. Designers focus on the hard UX problems, not pixel-pushing.
The idea-to-visual gap goes from days to seconds. Stakeholders see working HTML prototypes instead of static mockups. Iterate in real-time during the meeting, not after. The team spends less time waiting and more time making decisions that matter.
When AI handles the repetitive UI scaffolding, your designers focus on UX strategy, accessibility, and complex interactions. Your developers build business logic, not boilerplate. The goal isn’t fewer people — it’s higher-leverage work from the same team.
| Without | With Enterprise Playground |
|---|---|
| 1 mockup per review cycle | 20+ variations before the meeting ends |
| Idea → visual: days of handoff | Idea → working HTML: seconds |
| Designers pixel-push boilerplate | Designers focus on UX strategy & hard problems |
| Developers write UI scaffolding | Developers build business logic & integrations |
| Per-token cloud API fees | Unlimited local inference at $0/token |
| Feedback loops span multiple sprints | Iterate live, ship the same day |
14B code generator + 3B router running simultaneously on one GPU (10.5 GB VRAM)
Real-time HTML generation streamed token-by-token to the browser
ChromaDB + nomic-embed-text embeddings (CPU-only, zero VRAM) enrich prompts with domain context
SequenceMatcher-based dedup — identical prompts return instantly, saving 100% of tokens
Keyword + LLM classifier routes requests to the optimal model (3B for text, 14B for code)
Interactive 2D/3D UMAP projections of RAG embeddings via Plotly.js with click-to-inspect
Chunk browser, type distributions, per-workflow analytics, similarity search
LoRA r=32 training pipeline with dataset preparation, training, and Ollama deployment
Playwright-based capture of banking workflow UIs with full-page screenshots
User Prompt
|
v
+--------------+
| Smart Router | <-- qwen2.5:3b (keyword + LLM classification)
| (3B) |
+------+-------+
|
+----+----+
| |
Code Text
| |
v v
+------+ +------+
|Cache | | 3B |
|Check | |Direct|
+--+---+ +------+
|
HIT| MISS
| |
| v
| +----------+
| |RAG Query | <-- ChromaDB + nomic-embed-text (CPU)
| +----+-----+
| v
| +----------+
| |3B Compress| <-- Saves 30-50% input tokens
| +----+-----+
| v
| +----------+
| |14B Gen | <-- qwen2.5-coder:14b (SSE streaming)
| +----+-----+
| v
| +----------+
| |Cache Store|
+>|+ Save HTML|
+----------+
| Model | Role | VRAM | Context |
|---|---|---|---|
qwen2.5-coder:14b | HTML/CSS/JS generation | ~8.5 GB | 8192 tokens |
qwen2.5:3b | Routing, chat, compression | ~2.0 GB | 2048 tokens |
nomic-embed-text | RAG embeddings | 0 GB (CPU) | — |
| Total | ~10.5 GB | Leaves 5.5 GB for KV cache |
SSE streaming, prompt input, style selector, RAG context panel
Live iframe previews, search/filter/sort, CACHE and RAG badges
7-phase ML pipeline: Scrape > Map > Store > Route > Generate > Cache > Train
RAG ingest/clear/query tester, workflow browser, dataset prep
Model comparison (14B vs 3B), VRAM gauge, cache rate, activity log
RAG chunking, training lifecycle, adapter registry, pipeline diagram
Trace timeline, model distribution, router methods, token economy
UMAP 2D/3D scatter (Plotly.js), ChromaDB inspector, storage map
Python 3.11+ / FastAPI / Ollama / ChromaDB / UMAP / Plotly.js / Playwright / PyTorch + PEFT / SQLite
Next.js 15 / TypeScript strict / Tailwind CSS + shadcn/ui / Zustand / TanStack Query / Zod
Vitest for unit tests / Playwright for E2E / Storybook for component development