• 2508, Bytedance, DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization
  • 2507, Anthropic, Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data
  • 2505, Google, AlphaEvolve: A coding agent for scientific and algorithmic discovery
  • 2503, Anthropic, Circuit Tracing: Revealing Computational Graphs in Language Models
  • 2503, Anthropic, Auditing language models for hidden objectives
  • 2502, DeepSeek, Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention
  • Sora : Video generation models as world simulators