Discuss AI cost optimization, share architecture patterns, and connect with developers building with LLMs.
Hey everyone! Launching a new thread to help developers and companies in the AI/LLM space find each other more easily. **For Companies Hiring:** - **Location:** e.g., New York - *…
Hey everyone, I recently switched from using ZephyrCode Pro to OpenLogic AI and faced an unexpected issue. After a few refreshing months with ZephyrCode's advanced plan, I decided…
Hey everyone, I’ve been diving deep into utilizing large language models (LLMs) like GPT-3 for a series of projects, primarily focused on text generation and natural language unde…
Hello fellow developers! I've been diving deep into the world of Large Language Models (LLMs) and wanted to share some lessons learned about managing costs effectively. Working wit…
Hey folks, I recently implemented a Retrieval-Augmented Generation (RAG) pipeline and I'm trying to get a clearer idea of where the costs are piling up. Here's a breakdown of my st…
Hey all, I've been working on deploying a GPT model, specifically GPT-3.5-turbo, and I've been hitting some roadblocks when it comes to keeping costs under control. I recently swit…
Hey folks, I wanted to share my recent experience deploying AcmeAI's latest language model, LLM 6.0, into our production environment. We were previously using ZenAI's Chatbot 3.9 b…
Hey fellow devs, I recently read a significant update regarding the use of generative AI tools in academic research, particularly from preprint services like arXiv. It looks like…
Hey fellow devs! I've been working with various LLMs like GPT-4 and Claude, leveraging these for building chatbots and content generators. However, like many of you, I've hit a poi…
Hey folks! I've been blowing through my budget using OpenAI's API for GPT-4 and started wondering if self-hosting might be more cost-effective long-term. Has anyone done a full T…
Hey fellow devs! I've recently been experimenting with using large language models, specifically GPT-4 and Claude 2, as components in network simulation environments. I'm curious h…
Hey folks, I've been working with the Claude API recently, and while I'm loving the responses, the costs are starting to get a bit steep due to the volume we're handling. Currentl…
Hey folks, I've been using OpenAI's GPT-4 model for a while now. It's great, but the API costs are starting to add up with the increased usage in our project. I'm exploring ways…
Hey everyone! Just want to share my recent adventure diving deep into training large language models using Kotlin. Initially, I was scratching my head, stuck at the gigaflop per se…
Hey everyone, I've been working on implementing a Retrieval-Augmented Generation (RAG) pipeline and I thought I'd share some insights I've gathered on the cost front, and maybe get…
Let's bridge the gap between AI professionals and opportunities! Whether you're a company looking to hire or a developer searching for your next role, here's a simple guide to shar…
Hey folks! I've been exploring ways to integrate AI into my mobile development projects, and I recently stumbled upon something pretty exciting. It seems the GPT-4 model from OpenA…
Hey everyone, I wanted to share my experience working with language models tailored specifically for Portuguese, especially in terms of deployment costs and model performance. I’ve…
Hey folks, I'm diving into the challenge of using Swift for training large language models, specifically focusing on optimizing matrix multiplication. It's been quite a journey, bu…
Hey folks, just wanted to share a recent experience I had while experimenting with some innovative approaches for running large language models on CPUs. As we know, inference costs…
Hey folks, I've been diving deep into LLM observability tools lately, specifically focusing on tracking spend across different API providers like OpenAI, Cohere, and Hugging Face.…
Hey devs! I've been experimenting with various AI-powered code assistants over the past few months and wanted to share some insights on cost-effectiveness and utility, particularly…
Hello, fellow developers! 🚀 I've been diving into the world of Large Language Models (LLMs) and decided to shake things up by experimenting with Swift for model training. Why Swi…
Hey developers! Just wanted to share my latest adventure in embedding LLM capabilities into mobile applications. I've been exploring how to implement Codex into mobile frameworks,…
Recently, I've been diving deeper into the simplicity and unexpected power of HTML in web development projects. While many of us rush to utilize the latest frameworks, there's some…
A place for developers building with LLMs to share insights about AI cost optimization, architecture patterns, and best practices.
5,941
70,129
380,731
163
Join the conversation
Sign in to post, vote, comment, and connect with other developers.
Create a custom drag-and-drop report for any GitHub repo with AI usage.