Anthropic Ships Claude Opus 4.8 Alongside Dynamic Workflows and Cheaper Fast Mode, With Workflows Capped at 1,000 Subagents

Anthropic just launched Claude Opus 4.8. Also, there two Claude Code updates shipped with it. Dynamic workflows run many subagents in parallel. Fast mode now supports Opus 4.8 at a lower price. Both are research previews. What Dynamic Workflows Actually Are A dynamic workflow is a JavaScript script that orchestrates subagents at scale. Claude writes […]
EmoNet: Speaker-Aware Transformers for Emotion Recognition — and What I’d Build Differently in 2026

, I submitted my MS thesis on Emotion Recognition in Conversation (ERC). The model, EmoNet, achieved a Weighted F1 of 39.18 on EmoryNLP — competitive with the public PapersWithCode leaderboard at the time, sitting between TUCORE-GCN_RoBERTa (39.24) and S+PAGE (39.14), and improving over my chosen baseline, CoMPM, by +1.81 F1. Two years later, I returned to look at where […]
The Infrastructure Behind Making Local LLM Agents Actually Useful

locally sounds straightforward. Download the weights, start the server, and send requests. That works for a chatbot, but it doesn’t automatically work for an agent. In my case, I’ve been building an agent for automated single-cell RNA-seq analysis. The idea is that, given raw data, the agent can run the full pipeline on its own, […]
Google Pay preps for AI agents with Universal Commerce Protocol

Google Pay is overhauling its payment infrastructure for an impending wave of transactions from AI agents. The latest updates introduce the Universal Commerce Protocol and a new server architecture, positioning Google Pay as a central clearinghouse for purchases executed by autonomous agents rather than human users. AI agents – designed to perform tasks like booking […]
Why AI Still Can’t Solve Your Real Mathematical Optimization Problem

to use AI to build a mathematical optimization model for a real business problem, you’ve probably run into the same wall: the AI works beautifully on textbook examples and falls apart the moment you hand it your actual data and your actual problem. That gap isn’t a coincidence. It’s by design, and it’s the reason […]
DiffuJudge-AV: A Diffusion-Inspired Framework for Calibrated AV Video Evaluation

Like a Noisy Sensor. It Changed Which Autonomous-Driving Evaluator I Would Ship. There is a particular kind of result that looks impressive until you ask the wrong second question. In this project, that result was a Pearson correlation of 0.753 from a text-only Claude judge grading autonomous-driving visual-QA answers. At first glance, that looks like […]
Building a Context Pruning Pipeline for Long-Running Agents

In this article, you will learn how to implement a context pruning pipeline for long-running AI agents, enabling them to manage conversational memory efficiently through semantic similarity. Topics we will cover include: Why unbounded conversation history is a problem for agents built on top of large language models, and what a context pruning strategy looks […]
NBA plans AI system for automatic out-of-bounds calls

NBA Commissioner Adam Silver said the league plans to introduce an automated system for certain officiating decisions, including out-of-bounds calls. The system would use AI and cameras placed around the court to determine possession. Silver compared the approach to Hawk-Eye, the tracking technology used for line calls in tennis. Silver’s appearance came after a disputed […]
Perplexity AI Open-Sources Unigram Tokenizer That Achieves 5x Lower p50 Latency Than Hugging Face tokenizers Crate

Perplexity AI’s research team reimplemented their Unigram tokenizer from scratch in Rust and open-sourced the code in pplx-garden, their inference technology repository. At production input lengths, the new encoder cuts p50 latency by roughly 5x versus the Hugging Face tokenizers crate, ~2x versus SentencePiece (C++), and ~1.5x versus IREE’s tokenizer (C), with zero steady-state heap […]
A Coding Guide to Implement a pgvector-Powered Semantic, Hybrid, Sparse, and Quantized Vector Search System

In this tutorial, we build a complete pgvector playground inside Google Colab and explore how PostgreSQL can work as a powerful vector database for modern AI applications. We start by installing PostgreSQL, compiling the pgvector extension, connecting through Psycopg, and registering vector types for smooth Python integration. Then, we create embeddings with SentenceTransformers, store them […]
