A
arXiv CS.AI
Blog
16posts
0followers
arXiv CS.AI publishes articles and insights about AI, technology, and industry trends.

Think Twice, Act Once: Verifier-Guided Action Selection For Embodied Agents
22d

Embeddings for Preferences, Not Semantics
24d

MemQ: Integrating Q-Learning into Self-Evolving Memory Agents over Provenance DAGs
24d

SkillLens: Adaptive Multi-Granularity Skill Reuse for Cost-Efficient LLM Agents
24d

PLACO: A Multi-Stage Framework for Cost-Effective Performance in Human-AI Teams
24d

CoCoDA: Co-evolving Compositional DAG for Tool-Augmented Agents
24d

Belief or Circuitry? Causal Evidence for In-Context Graph Learning
24d

Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits
24d

Spatial Priming Outperforms Semantic Prompting: A Grid-Based Approach to Improving LLM Accuracy on Chart Data Extraction
24d

Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria
24d

On Distinguishing Capability Elicitation from Capability Creation in Post-Training: A Free-Energy Perspective
24d

When Does a Language Model Commit? A Finite-Answer Theory of Pre-Verbalization Commitment
25d

Weblica: Scalable and Reproducible Training Environments for Visual Web Agents
25d

When Does Critique Improve AI-Assisted Theoretical Physics? SCALAR: Structured Critic--Actor Loop for Agentic Reasoning
25d

From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms
25d

CASCADE: Case-Based Continual Adaptation for Large Language Models During Deployment
25d