-
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering
score 9
入选 HF Daily Papers; HF 热度: 16 upvotes (+3); 有代码实现; 关键词(2): post-training, reasoning
-
Lighting-grounded Video Generation with Renderer-based Agent Reasoning
score 9
入选 HF Daily Papers; HF 热度: 5 upvotes (+2); 关键词(3): lightweight, production, reasoning; 顶会接收: CVPR
-
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver
score 8
入选 HF Daily Papers; HF 热度: 204 upvotes (+4); 关键词(2): deployment, agentic
-
MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping
score 8
入选 HF Daily Papers; HF 热度: 83 upvotes (+4); 关键词(3): fine-tune, text-to-image, data curation
-
Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces
score 8
入选 HF Daily Papers; HF 热度: 8 upvotes (+2); 有代码实现; 关键词(1): synthetic data
-
Small Vision-Language Models are Smart Compressors for Long Video Understanding
score 8
入选 HF Daily Papers; HF 热度: 9 upvotes (+2); 有代码实现; 关键词(4): scaling, compression, distillation, vision-language
-
FIT: A Large-Scale Dataset for Fit-Aware Virtual Try-On
score 6
入选 HF Daily Papers; HF 热度: 14 upvotes (+3)
-
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization
score 5
入选 HF Daily Papers; HF 热度: 4 upvotes (+1); 关键词(2): GRPO, reasoning
-
Beyond Stochastic Exploration: What Makes Training Data Valuable for Agentic Search
score 5
入选 HF Daily Papers; HF 热度: 2 upvotes (+1); 关键词(2): agentic, reasoning
-
PokeGym: A Visually-Driven Long-Horizon Benchmark for Vision-Language Models
score 5
入选 HF Daily Papers; HF 热度: 3 upvotes (+1); 关键词(4): deployment, reasoning, vision-language, embodied