What is the overall sentiment around Command R?

Based on 39 social mentions analyzed, 21% of sentiment is positive, 74% neutral, and 5% negative.

Command R

open-source-modelllmtiered

Cohere Command is a family of highly scalable language models that balances high performance with strong accuracy.

Users of "Command R" commend its innovative use of artificial intelligence to optimize workflows and significantly reduce LLM token usage, which is considered time and cost-efficient. However, there are complaints regarding the stability of plugins, with instances of corruption in codebases being reported. The sentiment towards its pricing is not extensively discussed, implying it might not be a significant concern. Overall, "Command R" has a positive reputation among developers and tech enthusiasts for its functionality, though users are wary of some technical issues with certain features.

Website

Mentions (30d)

Reviews

Platforms

Sentiment

21%

8 positive

Pain Score: 0/10015 integrations10 featuresSeries E

Voices Discussing Command R

Aidan Gomez

CEO at Cohere

6 mentions

Jay Alammar

Author at Visualizing Transformers

2 mentions

Fei-Fei Li

Co-director, Stanford HAI and CEO at World Labs

1 mention

Share:Twitter LinkedIn

Product Screenshots

AI Summary

Features & Use Cases

Features

MultilingualRAG CitationsPurpose-built for real-world enterprise use casesAutomate business workflowsCommand family of modelsBlog postWhat’s possible with CommandPrivate deployment and customizationStreamline content creation at scaleNorth

Use Cases

Real-time transcription for customer service callsAutomated meeting notes generationVoice command interfaces for applicationsAccessibility solutions for hearing-impaired usersLanguage translation services in real-timeContent creation for podcasts and videosVoice-activated personal assistantsSpeech analytics for business insights

Company Intel

Industry

information technology & services

Employees

870

Funding Stage

Series E

Total Funding

$2.8B

Top Mention

lemmy@yogthos2 engagement1/16/2026

Cutting LLM token usage by 80% using recursive document analysis

When you employ AI agents, there’s a significant volume problem for document study. Reading one file of 1000 lines consumes about 10,000 tokens. Token consumption incurs costs and time penalties. Codebases with dozens or hundreds of files, a common case for real world projects, can easily exceed 100,000 tokens in size when the whole thing must be considered. The agent must read and comprehend, and be able to determine the interrelationships among these files. And, particularly, when the task requires multiple passes over the same documents, perhaps one pass to divine the structure and one to mine the details, costs multiply rapidly. **Matryoshka** is a tool for document analysis that achieves over 80% token savings while enabling interactive and exploratory analysis. The key insight of the tool is to save tokens by caching past analysis results, and reusing them, so you do not have to process the same document lines again. These ideas come from recent research, and retrieval-augmented generation, with a focus on efficiency. We'll see how Matryoshka unifies these ideas into one system that maintains a persistent analytical state. Finally, we'll take a look at some real-world results analyzing the [anki-connect](https://git.sr.ht/~foosoft/anki-connect) codebase. --- ## The Problem: Context Rot and Token Costs A common task is to analyze a codebase to answers a question such as “What is the API surface of this project?” Such work includes identifying and cataloguing all the entry points exposed by the codebase. **Traditional approach:** 1. Read all source files into context (~95,000 tokens for a medium project) 2. The LLM analyzes the entire codebase’s structure and component relationships 3. For follow-up questions, the full context is round-tripped every turn This creates two problems: ### Token Costs Compound Every time, the entire context has to go to the API. In a 10-turn conversation about a codebase of 7,000 lines, almost a million tokens might be processed by the system. Most of those tokens are the same document contents being dutifully resent, over and over. The same core code is sent with every new question. This redundant transaction is a massive waste. It forces the model to process the same blocks of text repeatedly, rather than concentrating its capabilities on what’s actually novel. ### Context Rot Degrades Quality As described in the [Recursive Language Models](https://arxiv.org/abs/2505.11409) paper, even the most capable models exhibit a phenomenon called context degradation, in which their performance declines with increasing input length. This deterioration is task-dependent. It’s connected to task complexity. In information-dense contexts, where the correct output requires the synthesis of facts presented in widely dispersed locations in the prompt, this degradation may take an especially precipitous form. Such a steep decline can occur even for relatively modest context lengths, and is understood to reflect a failure of the model to maintain the threads of connection between large numbers of informational fragments long before it reaches its maximum token capacity. The authors argue that we should not be inserting prompts into the models, since this clutters their memory and compromises their performance. Instead, documents should be considered as **external environments** with which the LLM can interact by querying, navigating through structured sections, and retrieving specific information on an as-needed basis. This approach treats the document as a separate knowledge base, an arrangement that frees up the model from having to know everything. --- ## Prior Work: Two Key Insights Matryoshka builds on two research directions: ### Recursive Language Models (RLM) The RLM paper introduces a new methodology that treats documents as external state to which step-by-step queries can be issued, without the necessity of loading them entirely. Symbolic operations, search, filter, aggregate, are actively issued against this state, and only the specific, relevant results are returned, maintaining a small context window while permitting analysis of arbitrarily large documents. Key point is that the documents stay outside the model, and only the search results enter the context. This separation of concerns ensures that the model never sees complete files, instead, a search is initiated to retrieve the information. ### Barliman: Synthesis from Examples [Barliman](https://github.com/webyrd/Barliman), a tool developed by William Byrd and Greg Rosenblatt, shows that it is possible to use program synthesis without asking for precise code specifications. Instead, input/output examples are used, and a solver engine is used as a relational programming system in the spirit of [miniKanren](http://minikanren.org/). Barliman uses such a system to synthesize functions that satisfy the constraints specified. The system interprets the examples as if they were relational rules, and the synthesis e