close

DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Why I built ragwise: pip-installable RAG with hybrid search, streaming, and agent tools by default

Why I built ragwise: pip-installable RAG with hybrid search, streaming, and agent tools by default

Comments
4 min read
Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache

Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache

BERJAYA 3
Comments
9 min read
Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call

Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call

BERJAYA BERJAYA BERJAYA 5
Comments
8 min read
We Open-Sourced Our Enterprise AI Agent Stack — 6 Libraries From 60+ Deployments.

We Open-Sourced Our Enterprise AI Agent Stack — 6 Libraries From 60+ Deployments.

Comments
9 min read
I Built a 7-Agent Prompt Framework, Then Used It to Debug Its Own Output

I Built a 7-Agent Prompt Framework, Then Used It to Debug Its Own Output

Comments
6 min read
Opus 4.7 First Look: I Tested the Day-Old Model Against 3 Other Claudes on 10 Real Tasks

Opus 4.7 First Look: I Tested the Day-Old Model Against 3 Other Claudes on 10 Real Tasks

Comments 1
5 min read
When one translation isn't enough: building konid

When one translation isn't enough: building konid

Comments
2 min read
All Data and AI Weekly #238-20April2026

All Data and AI Weekly #238-20April2026

BERJAYA BERJAYA BERJAYA 5
Comments
11 min read
The 96.3% Is a Trap: What Hermes 4 405B Actually Changed

The 96.3% Is a Trap: What Hermes 4 405B Actually Changed

Comments
8 min read
Local Voice-Controlled AI Agent (Whisper + Ollama + Streamlit)

Local Voice-Controlled AI Agent (Whisper + Ollama + Streamlit)

Comments
2 min read
EcomRLVE-GYM: Bài toán thật của shopping agent là hoàn tất giao dịch, không chỉ nói hay

EcomRLVE-GYM: Bài toán thật của shopping agent là hoàn tất giao dịch, không chỉ nói hay

Comments
23 min read
I tried to hack my local AI agent with Prompt Injection. It laughed at me.

I tried to hack my local AI agent with Prompt Injection. It laughed at me.

Comments
4 min read
Stop burning tokens on DOM noise: a Playwright MCP optimizer layer

Stop burning tokens on DOM noise: a Playwright MCP optimizer layer

Comments
2 min read
I Tried Building GPT Without Training — Just Math. Here’s Where It Broke | Shivnath Tathe

I Tried Building GPT Without Training — Just Math. Here’s Where It Broke | Shivnath Tathe

Comments
8 min read
Local LLM with Google Gemma: On-Device Inference Between Theory and Practice

Local LLM with Google Gemma: On-Device Inference Between Theory and Practice

Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.