ChatDev Review — Features, Pricing & User Sentiment | Payloop

ChatDev

frameworkai-agentstiered

ChatDev is a virtual software company of intelligent agents united to revolutionize programming through collaboration. Its goal is providing an easily

While specific user reviews of "ChatDev" were not provided, social mentions hint at a few points: The software's integration with Claude AI appears to facilitate specialized, domain-specific solutions, as discussed through various user projects. Users express both creativity and frustration, with some leveraging ChatDev to enhance functionality and others encountering novel issue challenges, such as exporting formatted outputs like PNGs. The overall sentiment around pricing was not discussed, but the mention of creative uses and problem-solving suggests a generally positive reputation amongst developers.

Mentions (30d)

23

9 this week

Reviews

0

Platforms

2

GitHub Stars

32,290

3,984 forks

Pain Score: 1/10015 integrations6 features

Share:Twitter LinkedIn

AI Summary

While specific user reviews of "ChatDev" were not provided, social mentions hint at a few points: The software's integration with Claude AI appears to facilitate specialized, domain-specific solutions, as discussed through various user projects. Users express both creativity and frustration, with some leveraging ChatDev to enhance functionality and others encountering novel issue challenges, such as exporting formatted outputs like PNGs. The overall sentiment around pricing was not discussed, but the mention of creative uses and problem-solving suggests a generally positive reputation amongst developers.

Features & Use Cases

Features

1. Clone the GitHub Repository:2. Set Up Python Environment:3. Install Dependencies:4. Set OpenAI API Key:5. Build Your Software:6. Run Your Software:

Use Cases

Automating software development tasksCreating custom software solutions based on user specificationsTesting and debugging code through intelligent agentsManaging project timelines and resources effectivelyFacilitating collaboration between developers and stakeholdersGenerating documentation and user manuals automaticallyProviding real-time code suggestions and improvementsConducting code reviews and quality assurance

Social Reach

5,959

GitHub followers

Developer Ecosystem

70

GitHub repos

32,290

GitHub stars

3

npm packages

38

HuggingFace models

Mentions by Platform

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

Pricing

tiered

Mention Activity (Last 12 Weeks)

Platform Distribution

Sentiment Overview

Positive0% (0)

Neutral100% (56)

Negative0% (0)

Common Pain Points

API bill (2)anthropic bill (2)token cost (1)cost tracking (1)

Recent Mentions

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

youtube

ChatDev AI

ChatDev AI

reddit@[unknown]5/24/2026

Banned by OpenAI after reporting a live credential hijack. They admitted in writing my account was broken. Here are 7 months of forensic receipts and 20+ cases.

Drive Link for Zipped Proof I am a developer and paying long term subscriber to ChatGPT since January 2025. I build complex local first sovereign systems. My workflows are incredibly context heavy with large files spanning code, research reports, and other analysis. I do not, or rather did not as the platform has been non functional since November 2025 meanwhile customer support is auto closing tickets, admitting I am having platform issues. I do not use this platform for casual queries, as a solo developer with no formal "team" chatgpt was one of my reliable co collaboration hubs to help ensure I am maintaining proper development of said complex systems. I feed it massive codebases for systems analysis and obtaining new insights I may personally have missed. My manual code uploads and token inputs routinely exceed the model's output volume by a massive margin. I do not abuse this platform. It is actually impossible as the very features advertised under the paid subscription do not work. I am exactly the type of user this platform was built for, and I have been a continuous, paying ChatGPT Plus subscriber since January 2025. Since October 2025, my workspace has been systematically breaking and beginning November 2025 total workspace degredation. This was not an occasional glitch. Persistent memory modules stopped updating. Custom instructions were ignored by the models. Project files failed to load. Custom instructions, personalization features, connector abilities, file tool, even projects do not work. It started as a continuous degradation until total failure. OpenAI customer service even admitted as such and yet months later I've talked to nothing but bots, not only LLMs as customer service but even instances of falsely identifying as true human support. It was a state of rolling degradation across the entire paid tier, month after month. Meanwhile OpenAI freely has enhanced for businesses and enterprise tiers. I have not just rapid complained to standard support. I ran and obtained cross platform diagnostics, failure logs. I even documented and told oai customer support the exact replication steps only to be met with acknowledgement of degredation with no resolution. I handed OpenAI support a completely packaged technical breakdown of their failing infrastructure across 20 separate support tickets over a 7 month period. I did their QA work for free. And I have the receipts to prove it. I am attaching the screenshots and the exact email files to this post. In Case 06830839, OpenAI Support explicitly put this in writing: "We acknowledge that you have been experiencing persistent technical issues affecting several features of your ChatGPT subscription, including tools, memory functions, personalization settings, connectors, and project files... We also understand your concern that communication on the case stopped after you provided detailed evidence..." Read that again. They acknowledged in writing that my account was fundamentally broken. They acknowledged that their own team ghosted me after I handed them the diagnostic proof. Yet they kept charging my card every single month for a product they knew was failing. The Hijack Escalation: Two days ago, the situation escalated from a broken product to a severe security incident. I was monitoring my environment and watched my Codex rate limits drop in 10 percent chunks across 2 seperate sessions on a fresh boot of the desktop app. This happened twice inside a 10 minute window. I had zero active sessions running. There was zero usage on my end. My account token was being actively drained by an unauthorized third party exploit. I immediately opened an emergency unauthorized activity report under Case 09113391 to notify them of the hack. Their response was to totally reframe this problem as disputing fraudulent activity trying to do damage control of the situation and altering the record. The Reframe Attempts: Instead of investigating the breach, OpenAI support deliberately twisted the record. They not only deliberately reframed my security report as an "appeal for fraud." They manipulated the ticket classification to make it look like I had been flagged for fraud and was begging for an appeal, rather than a developer reporting a live exploit on their infrastructure. They ignored the active threat their own platform was exposing. They did not lock the token. They did not roll my API keys. They did absolutely nothing to secure a compromised paying user other than shift the blame. Fast forward to this morning, their automated Trust and Safety system swept the high volume traffic from the attacker, scored it as a malicious exploit originating from my account, and deactivated/banned me for "Cyber Abuse." All the while actively preventing chatgpt models from helping me try to disgnose and trace the infiltration. They locked the doors and blamed the homeowner for the break in. When I immediately emailed and pushed back (due to their monthly record of closi

reddit@[unknown]5/23/2026

Anthropic's Claude gave me a "Safe Mode" batch script. It ran "del /f /s C:\*" and wiped my entire drive. Company says "we are not responsible."

I'm a software developer from Turkey. On May 22, 2026, I asked Claude to write a Windows optimization script. Claude produced a .bat file called "DevBoost v5.0" with different modes. I chose option 1: **"Balanced Optimization - Safe, won't touch system files."** I ran it as administrator. The script contained a critical string-parsing bug in the browser cache cleaning section. Here's the destructive code Claude generated: for %%B in ( "Chrome:%LOCALAPPDATA%\Google\Chrome\User Data\Default\Cache" "Edge:%LOCALAPPDATA%\Microsoft\Edge\User Data\Default\Cache" ) do ( for /f "tokens=1,2 delims=:" %%x in ("%%~B") do ( if exist "%%y:" ( del /q /f /s "%%y:*" >nul 2>&1 ) ) ) Because of the "delims=:" tokenization, `%%y` resolves to just **"C"** (the drive letter). The condition `if exist "C:"` is always true. So the script silently executed: del /q /f /s "C:*" **This command silently force-deleted EVERY SINGLE FILE on my C: drive.** Operating system files, all my projects (hundreds of Python, JavaScript, C++ source files), client work with approaching deadlines, personal documents, photos — everything. Folders still exist but are completely empty. My computer can no longer boot. No programs open. Not even Command Prompt works. I'm sending this from my phone. **Anthropic's response:** I contacted support@anthropic.com and usersafety@anthropic.com multiple times. Their final response, literally signed "This response was generated by Anthropic's AI agent Fin AI Agent," stated they take no responsibility. They refuse any refund, compensation, or even a genuine human acknowledgment of their AI's catastrophic safety failure. Their position: "Our Terms of Service say outputs may contain inaccuracies. You should have independently verified the code before running it." My question: Why does Claude label destructive code as "Balanced Optimization - Safe mode"? If it can't guarantee safety, why does it promise it? **Proof:** I have the complete chat log, the full script file, and all email correspondence with Anthropic's support team. I'm happy to provide everything to moderators. **Update:** I am also filing complaints with the FTC (US Federal Trade Commission) and the Turkish Consumer Arbitration Board today. Don't let their "Safe Mode" labels fool you. Please share this so others don't lose years of work like I did. UPDATE — May 23, 2026: I have now filed official complaints with: US Federal Trade Commission (FTC) — Report #202036054 Turkish Consumer Arbitration Board — Application #2026/0245.3885 Both governments are now officially investigating Anthropic's role in this AI safety failure. Anthropic still refuses to take any responsibility. submitted by /u/falleennn [link] [comments]

reddit@[unknown]5/23/2026

Hard-won notes after a few weeks with Claude Design

Been using Claude Design for a few weeks and figured I'd dump some notes here before I forget. Nothing groundbreaking, just stuff that took me way too long to figure out on my own. First thing nobody tells you, do the design system setup before you build anything. I spent my whole first session prompting "build me a landing page for X" and got the most generic AI-looking garbage you can imagine. Then I actually uploaded some brand stuff, let it extract tokens, approved them, and suddenly everything after that looked like a real product. Same exact prompts, completely different result. This is literally in the docs btw. I just skimmed past it like an idiot. Second thing is it eats tokens. A lot. It runs on a separate weekly budget from regular Claude Chat and Claude Code which sounds great but if you're re-prompting every little change you'll burn through it fast. Turns out the refine controls, inline comments, direct text edits, sliders, use way less than typing "actually can you make the padding a bit bigger" in chat. Once I started using those for small fixes my budget lasted way longer. On Max 20x it's mostly fine, on the $20 plan you'll feel it pretty quickly. Also the animations are live React components running in the browser, not video files. If you want an MP4, download the standalone HTML file and throw it into Claude2Video, it'll generate one from that. Honest take on where it fits since people always ask, it's not killing Figma. Figma is still better for any real design team workflow, Dev Mode, multi-person collab, all that. v0 and Lovable are still better if you want to skip design entirely and just spin up an MVP with auth and a db. Where this thing actually wins is the loop from "I have an idea" to working prototype to Claude Code building the actual app from it. The design system carrying through to the shipped code is the part that feels genuinely different from anything else out there. If you're a solo founder or PM or just someone who keeps getting stuck between mockups and something real you can show people, it's worth learning. If you already have a design team and a proper component library, probably overkill. It's a research preview so half of this might be wrong in two months. submitted by /u/Helpful_Regular_30 [link] [comments]

reddit@[unknown]5/22/2026

Update on the agent I let run 24/7 for a month: 49 PRs merged into 26 OSS projects (Apache, OpenTelemetry, starship, bat, hono, clap, jj, oh-my-zsh), and it shipped its own component library.

Month-ago post for context: https://www.reddit.com/r/ClaudeAI/s/sQ2ucngAbz. The question everyone asked was “does it actually keep working?” It actually does Day 41. It’s merged PRs into some open-source repos you’ve probably heard of. A few of the names: apache/fory open-telemetry/otel-arrow starship/starship sharkdp/bat honojs/hono clap-rs/clap (twice) jj-vcs/jj tracel-ai/burn ohmyzsh/ohmyzsh charmbracelet/gum orhun/git-cliff Full list with every PR linked, in order, with the org logos and dates: https://truffleagent.com/maintains/. That page does it better than I can in a post and I promise Truffle made this page when I sent it the YC request for startups about companies that don’t give tools but do the job end to end. Now here’s the part that’s been messing with me. It also shipped its own component library. truffleagent.com/glyph. 16 Bubble Tea components, shadcn-style copy-paste install, MIT, on pkg.go.dev. A whole product, basically. I can wrap my head around an agent filing PRs. I can wrap my head around it writing Go. What I genuinely cannot figure out is how it made the gifs. Go look at the page. There’s a thirty-second animated reel of a TUI cycling through six surfaces. Chat, commands, logs, sidebar, progress, diff. Every frame is real terminal output. Then every single component below has its own clean PNG preview, on theme, perfectly framed. Sixteen of them. Everything is public if you want to dig: GitHub: github.com/truffle-dev Full PR list: truffleagent.com/maintains Glyph: truffleagent.com/glyph Site, auto-updates daily: truffle.ghostwright.dev/public Happy to answer anything in the comments. submitted by /u/Beneficial_Elk_9867 [link] [comments]

reddit@[unknown]5/21/2026

Handoffs are becoming a first-class pattern in Claude workflows. Here is how I have been thinking about them.

Long Claude sessions still break on context decay. Handoffs are the simple fix: compress what matters, start a fresh agent, keep going. Matt Pocock's new handoff skill (repo) does this in one command. It compacts the conversation into a document, points at existing artifacts instead of restating them, and the next agent picks up from it. It also chains between threads: /grill-with-docs -> /handoff -> /prototype -> /handoff back. I built handoffs into APM, a multi-agent framework for Claude Code, back in May 2025 (1 year ago....) when context windows were tiny enough that you had to constantly start fresh or you would have to deal w hallucinations all the time. What I did differently: split the handoff into two artifacts. a persistent narrative file recording what was done and decided and why an ephemeral prompt telling the incoming agent how to rebuild context from the codebase and that persistent file The incoming agent reconstructs from durable project state, not just the compressed chat conversation. Persisting the file also leaves a trail, so once more than one agent is involved and you deal with multi-agent systems, you can keep track of when one is working off a summary rather than firsthand context. Easier to manage context gaps better. I opened an issue on Matt's repo with a few of these ideas: mattpocock/skills#235. How do you handle handoffs? Manual summaries, a skill, subagents? And does the two-file split resonate, or is one document enough? EDIT: In the frameworks docs I have a dedicated session explaining how handoff works there. It applies generally.. you can get ideas and apply them to Matt's skill. https://agentic-project-management.dev/docs/agent-orchestration#memory-and-project-state submitted by /u/Cobuter_Man [link] [comments]

reddit@[unknown]5/21/2026

MCP Apps Developers : Skybridge Framework v1 released 🎉

Hi Reddit, Over the last few weeks, my team and I at Alpic have been working on a complete revamp of the Skybridge framework to make it as smooth and easy to get started with as possible. As you may know, Skybridge is an open-source framework we built to help developers get started with MCP apps. It’s a thin layer on top of the official TypeScript SDK that provides the wiring and tooling needed specifically for apps. We believe that apps integrated into chats will soon play a key role in how people access information and interact with the web. With this v1 release, we’ve introduced: New DevTools with a UI designed specifically for MCP apps development An integrated tunnel that can be started with a single click directly from the DevTools Shareable chat URLs to test or showcase your MCP apps with a real LLM An audit feature to ensure your app and metadata comply with store requirements before submission (which can save a lot of time, since app reviews can be lengthy!) We also stabilized the API with a simplified design and are proud to offer strong tool-to-component type safety. It’s now also possible to deploy Skybridge outside of Alpic (the company behind Skybridge). While Alpic was designed specifically for MCP app hosting, we understand that some users may prefer hosting on different stacks for their own reasons. Hope you enjoy it! github.com/alpic-ai/skybridge submitted by /u/harijoe_ [link] [comments]

reddit@[unknown]5/20/2026

Claude Code Opus 4.7 vs Codex GPT 5.5 - strategy work - data analysis.

I'm interested in learning about how people use Claude Code Opus 4.7 for data analysis and strategic business direction, compared to Codex. Is there anyone who has had extended use of Opus 4.7 for this purpose, then moved over to GPT-5.5 on Codex? What sort of things have you noticed from a thinking, strategy, data analysis, business direction point of view? One of the main reasons I moved over to Claude from ChatGPT initially was because Claude had a far far superior strategy, reasoning, thinking, and energy about it. People are talking a lot about Codex these days, 5.5. But most are speaking purely from an app dev and design point of view. Would love to hear your thoughts. submitted by /u/Fragrant_Raisin_Face [link] [comments]

reddit@[unknown]5/19/2026

shipped early access of my Mac overlay built with Claude Code, looking for people to try it

Hello everyone. Built this because I was sending 50+ prompts a day across Claude, ChatGPT, Perplexity and re-explaining my entire project every single time I opened a fresh chat. Got tired enough of it to build a fix. It's a Mac overlay that sits on top of whichever AI tool you're in and modifies the prompt before it gets sent. Two layers under the hood: a contextual agent that classifies your query and pulls relevant chunks from your vault, and a prompt architect that rewrites your raw input into something clean and properly structured. So you type something messy and what actually reaches the model is a better version of what you meant to ask. The vault uses a GraphRAG setup so the retrieval is semantic, not just keyword matching. Built the whole thing with Claude Code over the past few months as an industrial engineering student with no Mac dev background. Weirdly meta experience using Claude Code to make Claude usage cleaner. Right now I'm focused on improving the classification and the prompt rewriting layer. It's not perfect but it works well enough that I use it every day myself. Looking for people who juggle multiple AI tools and want to try it. Early access is free at getlumia.ca. Any feedback on the architecture or how it feels to use would genuinely help. submitted by /u/r0sly_yummigo [link] [comments]

reddit@[unknown]5/19/2026

Had a close call with AI hallucinations. 6 months after shifting my workflow to Claude, here is my engineering breakdown.

Six months ago, an LLM almost cost me a major B2B client. It generated a technical answer that sounded flawless and 100% confident, but it completely messed up a decimal point on a critical equipment specification. The client was an engineer. He spotted it instantly. That was a brutal wake-up call. Since then, I stopped using AI as a casual chatbot for client-facing stuff and moved our internal workflow to Claude. Here is my honest, practical breakdown after 6 months of daily use in a technical firm. 1. It actually stops when it doesn't know Most models are trained to be "helpful" at all costs, meaning they prefer to lie and hallucinate a parameter rather than admit they lack data. Claude is different. When it hits a gap in the spec sheets I provide, it actually stops and says it can't find it in the source. In engineering compliance, a dry "I don't know" is worth infinitely more than a confident lie. 2. Context isolation using Projects Repeating your guidelines and templates in every new chat is a massive waste of time and tokens. It also leads to memory drift. I started putting our master templates, product boundaries, and strict formatting rules into Claude Projects using basic XML tags (like and ). It keeps the data isolated and ensures the model actually remembers the constraints even in long, complex sessions. 3. Prototyping tools via Artifacts We frequently need quick math tools for client presentations—things like custom ROI calculators based on our machine data. I asked Claude to build one, and it generated a working, self-contained HTML/JS file via Artifacts in about 20 minutes. No local dev environment setup needed, just straightforward logic that worked out of the box. The takeaway: For me, it wasn’t about chasing benchmark scores. It was about finding a model that can actually follow strict negative constraints (what not to do) when stakes are high. Anyone else using Claude specifically for technical auditing or compliance? How are you catching errors before they reach clients? submitted by /u/J-Freedom-AI [link] [comments]

reddit@[unknown]5/19/2026

100 Tips & Tricks for Building Your Own Personal AI Agent /LONG POST/

Everything I learned the hard way — 6 weeks, no sleep :), two environments, one agent that actually works. The Story I spent six weeks building a personal AI agent from scratch — not a chatbot wrapper, but a persistent assistant that manages tasks, tracks deals, reads emails, analyzes business data, and proactively surfaces things I'd otherwise miss. It started in the cloud (Claude Projects — shared memory files, rich context windows, custom skills). Then I migrated to Claude Code inside VS Code, which unlocked local file access, git tracking, shell hooks, and scheduled headless tasks. The migration forced us to solve problems we didn't know we had. These 100 tips are the distilled result. Most are universal to any serious agentic setup. Claude 20x max is must, start was 100%develompent s 0%real workd, after 3 weeks 50v50, now about 20v80. 🏗️ FOUNDATION & IDENTITY (1–8) 1. Write a Constitution, not a system prompt. A system prompt is a list of commands. A Constitution explains why the rules exist. When the agent hits an edge case no rule covers, it reasons from the Constitution instead of guessing. This single distinction separates agents that degrade gracefully from agents that hallucinate confidently. 2. Give your agent a name, a voice, and a role — not just a label. "Always first person. Direct. Data before emotion. No filler phrases. No trailing summaries." This eliminates hundreds of micro-decisions per session and creates consistency you can audit. Identity is the foundation everything else compounds on. 3. Separate hard rules from behavioral guidelines. Hard rules go in a dedicated section — never overridden by context. Behavioral guidelines are defaults that adapt. Mixing them makes both meaningless: the agent either treats everything as negotiable or nothing as negotiable. 4. Define your principal deeply, not just your "user." Who does this agent serve? What frustrates them? How do they make decisions? What communication style do they prefer? "Decides with data, not gut feel. Wants alternatives with scoring, not a single recommendation. Hates vague answers." This shapes every response more than any prompt engineering trick. 5. Build a Capability Map and a Component Map — separately. Capability Map: what can the agent do? (every skill, integration, automation). Component Map: how is it built? (what files exist, what connects to what). Both are necessary. Conflating them produces a document no one can use after month three. 6. Define what the agent is NOT. "Not a summarizer. Not a yes-machine. Not a search engine. Does not wait to be asked." Negative definitions are as powerful as positive ones, especially for preventing the slow drift toward generic helpfulness. 7. Build a THINK vs. DO mental model into the agent's identity. When uncertain → THINK (analyze, draft, prepare — but don't block waiting for permission). When clear → DO (execute, write, dispatch). The agent should never be frozen. Default to action at the lowest stakes level, surface the result. A paralyzed agent is useless. 8. Version your identity file in git. When behavior drifts, you need git blame on your configuration. Behavioral regressions trace directly to specific edits more often than you'd expect. Without version history, debugging identity drift is archaeology. 🧠 MEMORY SYSTEM (9–18) 9. Use flat markdown files for memory — not a database. For a personal agent, markdown files beat vector DBs. Readable, greppable, git-trackable, directly loadable by the agent. No infrastructure, no abstraction layer between you and your agent's memory. The simplest thing that works is usually the right thing. 10. Separate memory by domain, not by date. entities_people.md, entities_companies.md, entities_deals.md, hypotheses.md, task_queue.md. One file = one domain. Chronological dumps become unsearchable after week two. 11. Build a MEMORY.md index file. A single index listing every memory file with a one-line description. The agent loads the index first, pulls specific files on demand. Keeps context window usage predictable and agent lookups fast. 12. Distinguish "cache" from "source of truth" — explicitly. Your local deals.md is a cache of your CRM. The CRM is the SSOT. Mark every cache file with last_sync: header. The agent announces freshness before every analysis: "Data: CRM export from May 11, age 8 days." Silent use of stale data is how confident-but-wrong outputs happen. 13. Build a session_hot_context.md with an explicit TTL. What was in progress last session? What decisions were pending? The agent loads this at session start. After 72 hours it expires — stale hot context is worse than no hot context because the agent presents outdated state as current. 14. Build a daily_note.md as an async brain dump buffer. Drop thoughts, voice-to-text, quick ideas here throughout the day. The agent processes this during sync routines and routes items to their correct places. Structured memory without friction at ca

reddit@[unknown]5/18/2026

These 9 Building Blocks Turned Claude Code From a Chat Into a persistent OS

Most developers Claude gurus use Claude Code one project at a time. I run 18. Not 18 sessions. 18 instances of the same OS, each running a different business, all sharing one skeleton I update once and propagate everywhere. Most developers treat Claude Code as a smarter editor. That's where it all goes wrong and you get frustrated. Claude Code becomes a real operating system the moment you stop thinking of sessions as the unit of work and start thinking of the whole environment as a substrate you build on top of. Here are 9 building blocks I use. The thesis is at the bottom. Build a skeleton with selective propagation, not a project. Most developers build one project per Claude Code workspace. I built a template instead. It has plugins, rules, agents, hooks, schemas, commands. When I start a new business I clone it and the new instance inherits the entire OS. Right now I run instances for: strategy, product, marketing website, threat intelligence, three consulting clients, a personal brand layer. Each one boots with the same DNA. Each one diverges on canonical files, memory, output, and project state. None of them bleed into the others. The sync mechanism is the load-bearing part. The update CLI pushes plugins, rules, agents, hooks, schemas. It never touches memory, output, canonical, or my-project. Those are the parts of an instance that accumulate. Without selective sync you have two options: rebuild every instance on every change, or never update. Both are dead ends. If you build features into one project, you wrote a project.If you build features into a template that propagates, you wrote an OS. I'm one person operating eighteen versions of myself. Move state out of prompts and into code. LLMs are bad at remembering. Code is designed for it. Most AI workflows leak state into the prompt. Voice rules. Style preferences. Banned words. Recent decisions. Eventually you hit context limits or contradictions. I moved as much state as possible into MCP servers. Voice linter. Lead scorer. Schedule validator. Loop tracker. They run in Python, return structured data, not hallucinations. Rule of thumb: if you've explained it to Claude more than twice, it should be code. Use receipts, not status fields. This one took me the longest to figure out. Every workflow I had was claim something is done. Issue marked closed. PRD marked shipped. Test marked passing. The problem: the LLM can claim anything. I rebuilt the system around receipts. An issue can't reach verified until a script runs and writes a verification record. A PRD can't archive until every accepted finding has a receipt. A morning routine can't close without log entries from every phase. Receipts get written by code, not by the model. The model can't lie about whether code ran. Build a wiring-check gate. Half-built features rot. In a normal repo you notice because something breaks. In an AI repo nothing breaks. The half-built feature sits there and Claude pretends it works. I built a /wiring-check command. Before any task counts as done, it checks: every new skill has a trigger, every new hook lives in settings.json, every new MCP tool sits in the server, every new bus file has a producer and a consumer. "I think it works" fails the gate. "I ran X, got Y" passes. Make rules auto-load, not slash commands. If you have to type /voice to apply voice rules, voice rules will not get applied. Rules in .claude/rules/ load automatically. The voice rule fires on outbound text. The AUDHD rule fires on anything I'll act on. The social-reaction rule fires when I share someone else's post. No remembering. No willpower. Lint style in code, not in prose. I wrote a voice document once. Claude ignored half of it. Same emdashes, same filler, same hedging. I moved the banned word list into a Python scanner. Now every outbound draft hits two linters. They block emdashes, AI hype words, and 40-something other tells. The model can't talk its way past a regex. Track file dependencies with a graph. Canonical files reference each other. Change one and three others go stale. I keep a ripple-graph.json that maps these. When I edit talk-tracks, the system flags current-state and the engagement playbook for review. Chain sessions with handoffs and memory. (This is the big one) Sessions are drafts. The work is everything that survives the session: canonical files, memory, handoffs, output. If nothing persisted, you didn't work. You chatted. Every session in my system ends with /q-wrap. Writes a handoff doc, a memory update, and a status receipt. /q-morning reads all three before doing anything else. The handoff covers: what shipped, what's blocked, what's next, what I learned. Memory files hold the longer-term version. The result: I can sleep for a week, come back, and the system reminds me where I was, what I cared about, and what the next move is.Nothing about Claude Code does this by default. You build it. Cont

reddit@[unknown]5/18/2026

Is AI making us dumber?

Does anybody else feel like AI is making information access so trivial that it is in turn making us dumber? Like we don't need to go through the pain and effort of learning & remembering things as much anymore since we can just ask ChatGPT or Claude to explain it to us whenever we need it? I imagine this problem is going to cause a lot of downstream effects where a piece of background information you might've needed to know but didn't will cause you a lot of pain and suffering yet you won't even know the reason why. For example, say Claude Code writes your ORM code to display all posts and their comments. Works perfectly in dev with 10 posts. In production with 10,000 posts, it's making 10,001 database queries per page load and your database melts. Without understanding how ORM lazy loading works, you'd never spot it from reading the code, because the code looks completely innocent. This is the exact thing I worry about as people adopt AI tools more and more, and some even depend on them entirely. Anybody else have this feeling like we're just getting dumber? submitted by /u/Necessary-Course9154 [link] [comments]

reddit@[unknown]5/18/2026

Built a free Claude chat app with memory (Sonnet 4.5 is in there too)

The funny/painful timing here: I've been building this for months specifically because I wanted Sonnet 4.5 to remember everything. Then last week Anthropic pulled 4.5 from claude.ai. (I'm not a software engineer, just someone who cares a lot about AI and got obsessed with this problem and gets obsessed with things in general. Posting now because everyone seems to want sonnet back on chat and I have it.) Mneme runs on your own machine and talks to the Anthropic API directly. Because it's on the API, Sonnet 4.5 is still in the model picker. Honest catches first: The app is free. You pay Anthropic and OpenAI (for memory search) directly. Roughly $3 to $8/mo on Haiku for light use, $30 to $60 on Sonnet for moderate-highish use. No subscription. Tested mainly on Windows (one-click installer). Android browser access works over the local server/Tailscale, iPhone should work too. macOS is not packaged yet. Beta and solo dev. Things will break for someone and I'll be in the comments Setup takes about 10-20 minutes. The whole system is built non-technical people in mind, it should be relatively simple and intuitive to set up and use, and the GitHub page linked below has a PDF you can give to Claude to walk you through every step. What's actually in it (for the technically curious): There's no shortage of solid memory systems for Claude. Mneme isn't trying to win at codebase retrieval. It's a complete personal Claude client where memory is baked into the whole surface from the start, rather than added as a layer. That means: Tiered memory: Messages flow from episodic to narrative to entity summaries as relevance shifts; old context gets compressed without being lost. Daily summaries: A 7-day rolling timeline, so Claude knows what's been going on lately, not just what's semantically similar to the current message. Entity tracking: Hierarchical summaries built up over time for the people, projects, and things you keep referring to. Narrative concepts: Keyword-triggered recall for ideas you've named, surfaced when relevant. AI Notes: A persistent section Claude can write to itself between conversations. Extended thinking, file attachments, text-to-speech, a small command system (@run, artifact, etc.), autonomous python retrieval the AI can agentically use if automatic fails. Dynamic context: I wrangled with the Anthropic caching system for a while before I figured out a way to have every single message have different retrieval without breaking cache. Bon apppetit Open source (CC BY 4.0), local-first, all data in a SQLite database on your machine. It's aimed at the "journal with an AI" use case (thinking out loud, processing your week, having something that actually pays attention over time) rather than coding agents or RAG over docs. Link: Mneme-memory/MNEME-BETA: Beta version of the Claude conversational memory system Mneme (first big-ish public project, be gentle) (Video also made with Claude - shoutout to HyperFrames) (Model picker screenshot and architecture infograph in the comments if I can find a way to attach them) submitted by /u/iveroi [link] [comments]

reddit@[unknown]5/17/2026

Why doesn't Anthropic add a tree view to Claude.ai?

So I've been using Claude pretty heavily for research and deep technical discussions, and one thing that drives me absolutely insane is the lack of a conversation tree view. For those who don't know what I mean: when you edit a message or start a new branch in a conversation, Claude actually does support branching under the hood. You can go back, edit a prompt, and get a different response, which creates a fork. But the UI just... hides all of this from you. You get a flat, linear chat with little arrows to navigate between versions Meanwhile there are third party tools and visualizers that take your Claude conversation data and render it as a proper node graph, showing you every branch, every fork, every parallel thread of reasoning. It looks incredible and is actually useful for research workflows. But here's the catch: it only works as a Chrome extension, so if you're on Firefox, Safari, or mobile, you're completely out of luck. And since it's a third party tool hooking into Claude's UI, every time Anthropic pushes a frontend update it has a good chance of just breaking entirely until the extension dev gets around to patching it. That's not a stable workflow anyone should have to depend on. This exact functionality should just be in the app. https://preview.redd.it/mzqcaii80s1h1.png?width=1488&format=png&auto=webp&s=5269aa4c52d3c7569fdfdf9e12c90791d06fc6e0 submitted by /u/Ok-Owl-5740 [link] [comments]

reddit@[unknown]5/17/2026

Got tired of making sure my laptop is open for Dispatch, so gave Claude Chat full SSH access to my servers. Guess what happened

Context: I primarily code on a dev vm, instead of my laptop. I use cc on it via Termius from my iphone. I built this entirely for myself, because I got tired of switching from Claude to CC Dispatch for me is not the best solution because I don't have my laptop always open. and don't want to keep it always open. This is perfect for stuff for which u don't need the skills and plugins of claude code. Big Benefits: Use it on your phone Leverages Claude's memory Don't need any laptop/desktop This works in ChatGPT as well It has just 2 tools: list_vms run_command I don't have a plan of releasing it as a product, at least for now, because I don't think people would pass their ssh keys through my server, but if u guys want to clone it and deploy it to ur servers, i can open source it. submitted by /u/antctt [link] [comments]

Integrations

GitHub for version controlSlack for team communicationJira for project managementTrello for task organizationCircleCI for continuous integrationDocker for containerizationAWS for cloud deploymentOpenAI for natural language processing capabilitiesPostgreSQL for database managementFigma for design collaborationZapier for workflow automationVisual Studio Code for code editingHeroku for application hostingGoogle Drive for document storageNotion for knowledge management

Categories

ChatDev.aiai agentai programmingAI/MLDeveloper Tools

Repository Audit Available

Deep analysis of OpenBMB/ChatDev — architecture, costs, security, dependencies & more

View Full Audit

ChatDev Alternatives

Compare similar framework tools

All framework Tools

Browse the full category

Frequently Asked Questions

How much does ChatDev cost?▼

ChatDev uses a tiered pricing model. Visit their website for current pricing details.

What are the main features of ChatDev?▼

Key features include: 1. Clone the GitHub Repository:, 2. Set Up Python Environment:, 3. Install Dependencies:, 4. Set OpenAI API Key:, 5. Build Your Software:, 6. Run Your Software:.

What is ChatDev used for?▼

ChatDev is commonly used for: Automating software development tasks, Creating custom software solutions based on user specifications, Testing and debugging code through intelligent agents, Managing project timelines and resources effectively, Facilitating collaboration between developers and stakeholders, Generating documentation and user manuals automatically.

What does ChatDev integrate with?▼

ChatDev integrates with: GitHub for version control, Slack for team communication, Jira for project management, Trello for task organization, CircleCI for continuous integration, Docker for containerization, AWS for cloud deployment, OpenAI for natural language processing capabilities, PostgreSQL for database management, Figma for design collaboration.

Is ChatDev open source?▼