MASCing: Configurable Mixture-of-Experts Behavior via Activation Steering Masks Paper • 2604.27818 • Published Apr 30 • 5
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published Apr 13 • 7
DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Paper • 2604.13902 • Published Apr 15 • 62