-
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens
score 13
入选 HF Daily Papers; HF 热度: 28 upvotes (+4); 有代码实现; 关键词(1): scaling; 顶会接收: CVPR
-
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
score 11
机构: NVIDIA; 入选 HF Daily Papers; HF 热度: 41 upvotes (+4); 关键词(6): distillation, post-training, MoE, agentic, coding
-
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding
score 10
入选 HF Daily Papers; HF 热度: 78 upvotes (+4); 有代码实现; 关键词(2): reasoning, embodied
-
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
score 10
入选 HF Daily Papers; HF 热度: 59 upvotes (+4); 有代码实现; 关键词(3): fine-tuning, pre-training, open-source
-
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
score 10
入选 HF Daily Papers; HF 热度: 47 upvotes (+4); 有代码实现; 关键词(3): production, fine-tune, pre-training
-
FASTER: Rethinking Real-Time Flow VLAs
score 10
入选 HF Daily Papers; HF 热度: 45 upvotes (+4); 有代码实现; 关键词(3): latency, real-time, vision-language
-
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
score 10
入选 HF Daily Papers; HF 热度: 28 upvotes (+4); 有代码实现; 关键词(1): reasoning
-
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
score 10
入选 HF Daily Papers; HF 热度: 27 upvotes (+4); 有代码实现; 关键词(1): open-source
-
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
score 9
入选 HF Daily Papers; HF 热度: 36 upvotes (+4); 有代码实现
-
Memento-Skills: Let Agents Design Agents
score 9
入选 HF Daily Papers; HF 热度: 31 upvotes (+4); 有代码实现