Implementing Hybrid Semantic-Lexical Search in RAG

Implementing Hybrid Semantic-Lexical Search in RAG

In this article, you will learn how to implement a hybrid search strategy for RAG systems by combining BM25 lexical search with semantic search, fused together using Reciprocal Rank Fusion. Topics we will cover include: Why hybrid search outperforms either lexical or semantic search alone in retrieval-augmented generation systems. How to implement BM25 lexical search […]

Introducing the Agent Toolkit for Amazon Web Services

Introducing the Agent Toolkit for Amazon Web Services

What Is Agent Toolkit for AWS? is an open-source project developed by AWS that helps AI coding agents work with AWS more reliably. With the recent addition of the newly released MCP server as part of the Toolkit, coding agents using the Toolkit can now access the AWS-specific context, workflows, guardrails, and tools they need to […]

Best Authentication Platforms for AI Agents and MCP Servers in 2026

Best Authentication Platforms for AI Agents and MCP Servers in 2026

The Model Context Protocol has moved from Anthropic’s internal experiment to a de facto industry standard at a speed few integration protocols have matched. Since its launch in November 2024, MCP has grown explosively: OpenAI adopted it in March 2025, Microsoft announced support in Copilot Studio in March 2025, and by late 2025 combined Python […]

WorkOS Releases auth.md: An Open Agent Registration Protocol Built on OAuth Standards

WorkOS Releases auth.md: An Open Agent Registration Protocol Built on OAuth Standards

For years, authentication on the web followed one design assumption: a human sits behind a browser. Click a button. Fill out a form. Verify an email. Copy an API key and paste it somewhere else. That model does not work when the user is delegating work to an agent. Agents are already writing code, opening […]

Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments

Build a Complete Langfuse Observability and Evaluation Pipeline for Tracing, Prompt Management, Scoring, and Experiments

print(“\nPART 5 ── Datasets & experiments ————————————–“) DATASET = “capital-cities-tutorial” langfuse.create_dataset(name=DATASET, description=”Capital-city QA benchmark”) _items = [ (“What is the capital of France?”, “Paris”), (“What is the capital of Germany?”, “Berlin”), (“What is the capital of Japan?”, “Tokyo”), (“What is the capital of Italy?”, “Rome”), ] for i, (q, a) in enumerate(_items): langfuse.create_dataset_item(dataset_name=DATASET, id=f”cap-{i}”, input={“question”: […]

StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension

StepFun Releases StepAudio 2.5 Realtime: An End-to-End Voice Model with Roleplay-Specific RLHF and Paralinguistic Comprehension

StepFun, the Shanghai-based AI lab, released StepAudio 2.5 Realtime. It is an end-to-end real-time speech large language model with fully customizable persona capabilities. StepAudio 2.5 Realtime is a voice model that operates in real time. Unlike pipeline-based systems that separate speech recognition, reasoning, and synthesis into sequential steps, this is an end-to-end model. Audio goes […]

The Ultimate Beginners’ Guide to Building an AI Agent in Python

The Ultimate Beginners’ Guide to Building an AI Agent in Python

Introduction to AI Agents of the decade. You hear it everywhere on job descriptions, tech companies’ profiles, freelancers’ projects, etc. As overwhelming as it may sound, building an AI Agent is not that difficult. On the contrary, you can easily build a simple AI Agent in a couple of minutes. This is what we will […]

Beyond the Model: Why Data Scientists Must Embrace APIs and API Documentation

Beyond the Model: Why Data Scientists Must Embrace APIs and API Documentation

1. Introduction at the intersection of various domains — statistics, programming, AI — the ability to convey complex methodologies and insights becomes crucial. Thus, a skill to deal with comprehensive API concepts is essential for effective communication within the team. First, it fosters collaboration among team members and stakeholders. Data Science (DS) projects often involve […]

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.1% on Odysseys, Up from Base GPT-5.4’s 33.5%

Most web agents today drive a browser one action at a time. The model receives the current page state — as a screenshot or DOM text — and predicts the next click, keypress, or scroll. This action-at-a-time design made sense when language models had limited reasoning ability. As models have become more capable at writing […]

NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule

NVIDIA AI Releases Gated DeltaNet-2: A Linear Attention Layer That Decouples Erase and Write in the Delta Rule

Linear attention replaces the unbounded KV cache of softmax attention with a fixed-size recurrent state. This cuts sequence mixing to linear time and decoding to constant memory. The hard part is not what to forget. It is how to edit a compressed memory without scrambling existing associations. NVIDIA has released Gated DeltaNet-2, a linear attention […]