How do Large Language Models Navigate Honesty and Helpfulness?
Do We Need Zero Training Loss After Achieving Zero Training Error?
The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought