arXiv cs.AI

Steering LLM Personality via Latent Feature Interventions

Researchers have developed a mechanistic method to steer LLM personality traits by identifying and modifying latent features in the model's residual stream using sparse autoencoders, enabling precise behavioral control without retraining.

arXiv cs.AI

MedEvoEval: A Longitudinal Framework for Evaluating Doctor Agents

MedEvoEval is a new evaluation framework that moves beyond static medical QA by testing how doctor agents learn, retain, and adapt clinical decision-making skills across sequences of simulated outpatient episodes.

Specialized Clinical AI Outperforms General Models in Real-World Use

A study of 620 real-world clinical queries shows that specialized AI tools significantly outperform general-purpose models across accuracy, utility, and verifiability, highlighting the need for domain-specific evaluation.

HyphaeDB: Moving From Passive Storage to Agent-Native Memory

HyphaeDB reinterprets HNSW graph topology as a communication fabric for multi-agent systems, enabling knowledge propagation and emergent consensus rather than just passive retrieval.

ComMem: Dual-Memory Systems for VLM Test-Time Adaptation

ComMem improves VLM robustness by mimicking biological memory, using a fast-adapting visual cache and a slow-integrating textual prototype system to maintain cross-modal consistency during test-time adaptation.

Agentic Abstention: Improving When LLM Agents Should Stop

LLM agents often fail to stop when a task is impossible, leading to unnecessary tool use. The CONVOLVE method improves timely abstention by distilling interaction trajectories into reusable stopping rules.

Agent Safety Is Action Alignment, Not Content Refusal

Treating agent safety like chatbot content moderation is a category error. True agent security requires enforcing least privilege at the action boundary, not training models to refuse requests.

Making LLM Self-Evolution Safe with Held-Out Selection

RSEA improves LLM agent performance by recursively evolving natural-language artifacts while using a strict held-out validation gate to prevent performance regression.

Stabilizing Critic-Free RL with BV-Blend

BV-Blend improves reinforcement learning stability by blending prompt-local statistics with historical cluster-based moments, preventing training stalls when reward variance is zero.

IMCBench: Evaluating Multimodal LLMs in Clinical Conversations

IMCBench is a new multi-turn, image-grounded benchmark for medical AI that reveals a critical gap: accurate clinical descriptions do not guarantee safe patient guidance.

ATHENA-R1: An AI Agent for Iterative Biomedical Treatment Reasoning

ATHENA-R1 is an AI agent that performs iterative treatment reasoning by dynamically querying a universe of 212 biomedical tools, outperforming GPT-5 by significant margins in clinical benchmarks.

Closing the Loop Between Model Evaluation and Data Intervention

By introducing 'capability slices'—groups of evaluation samples categorized by task and operation—engineers can transform benchmark failures into precise, actionable data interventions rather than relying on intuition.

GPTNT: A Real-Time Collaborative Benchmark for AI Agents

GPTNT uses the game 'Keep Talking and Nobody Explodes' to test AI agent collaboration under time pressure, revealing critical failures in state tracking and real-time communication.

arXiv cs.AIMLOps & InfrastructureJun 29, 2026

COMPASS: Improving Compositional Control in Multimodal Models

COMPASS introduces a unified framework that uses a shared 'expert token' to bridge composition perception and generation, enabling precise layout control in multimodal models.

DAY 02Yesterday JUN 29 · 202616 SUMMARIES

Scaling Item Knowledge with JD's Oxygen AIIC Platform

JD.com's Oxygen AIIC uses a hybrid LLM/VLM architecture to automate item-knowledge production at scale, achieving 94.2% precision and 82.8% recall across tens of billions of SKUs.

arXiv cs.AI

Agent-Native Immune System (ANIS): Architecture for Runtime Defense

The Agent-Native Immune System (ANIS) shifts AI security from static training-time alignment to dynamic, runtime defense, using a six-layer 'Immune Tower' to protect autonomous agents against memory poisoning and tool-chain manipulation.

ATOD: Hybrid Distillation for Autonomous Agent Training

ATOD combines on-policy distillation with reinforcement learning using an annealed schedule and turn-level reweighting to train small agent models that outperform their larger teacher models.

arXiv cs.AIRAG & RetrievalJun 29, 2026

Odyssey: A Categorical Framework for Verifiable Foundation Models

Odyssey uses categorical sheaf theory to compose modular 'foundries'—verifiable, truth-preserving architectural components—that allow for structured, queryable, and auditable LLM-based systems.

DysLexLens: Analyzing Dyslexic AI User Experiences via LLMs

DysLexLens is an end-to-end framework that extracts, structures, and validates insights from noisy online forum data to understand how dyslexic learners interact with AI tools.

ToE: Hierarchical Claim Verification Against Adversarial Misinformation

Tree of Evidence (ToE) is a fact-checking framework that uses a reinforcement learning-driven agent to decompose claims into hierarchical argument trees, significantly improving verification accuracy against adversarially poisoned inputs.

Improving Long-Horizon LLM Planning via Symbolic Feedback

This framework enhances LLM planning reliability by using a symbolic verifier to identify errors and provide corrective, interpretable instructions for iterative self-refinement.

AI-ModelNet: A Networked Architecture for Collaborative AI

AI-ModelNet proposes a hierarchical, Internet-inspired architecture to enable interconnection and collaborative reasoning among heterogeneous, domain-specific models, addressing the fragmentation of the current AI landscape.

Personality Prompting in Multi-Agent Teams: Task-Dependent Impact

Personality manipulation in LLM agents significantly alters communication style but only degrades task performance in open-ended or collaborative domains, while remaining largely neutral in structured coding tasks.

arXiv cs.AIAI AutomationJun 29, 2026

Tandem Reinforcement Learning: Aligning AI Reasoning with Humans

Tandem Reinforcement Learning (TRL) forces stronger models to co-generate reasoning with weaker models, resulting in more legible, robust, and human-compatible chains of thought without sacrificing performance.

Scaling E-commerce Item Knowledge with LLM-Centric Architectures

JD.com's Oxygen AIIC platform uses a 'Semantic Search then Discrimination' architecture and human-AI collaboration to manage tens of billions of SKUs, achieving 94.2% precision in automated item knowledge production.

Architecting an Agent-Native Immune System (ANIS) for AI Security

The Agent-Native Immune System (ANIS) moves security from external training-time alignment to an endogenous, runtime defense architecture that protects autonomous agents from hijacking and manipulation.

ATOD: Hybrid Training for High-Performance AI Agents

ATOD combines on-policy distillation with reinforcement learning to overcome the performance ceiling of imitation learning, using an annealed schedule and turn-level reweighting to improve long-horizon agent training.

Tree of Evidence: Hierarchical Fact-Checking Against AI Misinformation

ToE (Tree of Evidence) is a hierarchical framework that combats AI-generated misinformation by decomposing claims into dynamic argument trees, using reinforcement learning to retrieve and verify evidence across multiple sources.

Mitigating Rollout Error in Graph World Models

Graph World Models (GWMs) face unique long-horizon errors where local inaccuracies propagate through topology. The Error-Aware GWM framework uses spectral regularization and critical-node weighting to maintain stability during dynamic-edge rollouts.