Wiki
- 2025 12 10 EGGROLL 262
- 2025 11 27 Qwen3-VL 82
- 2025 11 22 Ouro - Scaling Latent Reasoning via Looped Language Models 114
- 2025 11 21 Code World Models 0
- 2025 11 19 Diffusion Transformers with Representation Autoencoders 123
- 2025 11 19 SAM 3D 110
- 2025 11 19 SAM 3 0
- 2025 11 08 Gemma 3 4
- 2025 11 04 Qwen Edit 418
- 2025 10 27 GEPA 0
- 2025 07 17 SmolLM3 118
- 2025 07 17 Kimi K2 485
- 2025 07 15 H-Net - Dynamic Chunking for End-to-End Hierarchical Sequence Modeling 316
- 2025 02 04 OmniHuman-1 97
- 2025 01 30 WARP 310
- 2025 01 27 Qwen2.5 VL 125
- 2025 01 23 FAST - Efficient Robot Action Tokenization 125
- 2025 01 22 GRPO 429
- 2025 01 20 ColPali & ColQwen 0
- 2025 01 20 ColBERT 0
- 2025 01 20 SPLADE 0
- 2025 01 20 DeepSeek R1 148
- 2025 01 20 DeepSeek v3 153
- 2025 01 19 GLiNER - General NER 0
- 2025 01 19 SetFit 0
- 2025 01 12 Ring Attention 447
- 2025 01 07 ReAct 202
- 2025 01 06 Movie Gen 264
- 2024 12 31 ModernBERT 548
- 2024 12 28 Multi-Head Latent Attention (MLA) 297
- 2024 12 18 C4AI Command R7B 161
- 2024 12 14 Not All Tokens Are What You Need For Pretraining 150
- 2024 12 14 You Only Cache Once (YOCO) 368
- 2024 12 12 Kolmogorov-Arnold Theorem 203
- 2024 12 09 DeltaNet 317
- 2024 12 06 InternVL 0
- 2024 12 06 Unified-IO 257
- 2024 12 05 Latent Diffusion 36
- 2024 12 05 Diffusion Transformer (DiT) 698
- 2024 12 05 MMDiT - Multi Modal Diffusion Transformer 74
- 2024 12 05 PaliGemma 752
- 2024 12 03 Stable Diffusion 3 and 3.5 476
- 2024 12 03 FLUX 486
- 2024 11 30 HNSW 0
- 2024 11 30 LO-PQ 0
- 2024 11 30 Test Time Learning (Local Learning) 198
- 2024 11 30 Gecko - Versatile Text Embeddings Distilled from Large Language Models 74
- 2024 11 30 Contextual Document Embeddings (CDE) 286
- 2024 11 30 Maximal Update Parametrization (μP) 707
- 2024 11 30 DETR 0
- 2024 11 29 Vision-Language-Action Models (VLA) 273
- 2024 11 29 Speech-to-Speech 27
- 2024 11 29 WaveNet 36
- 2024 11 29 Wav2vec 89
- 2024 11 29 Conformer 36
- 2024 11 27 LayerSkip 0
- 2024 11 27 Mixture-of-Transformer 0
- 2024 11 27 Mixture of Depth 0
- 2024 11 27 KV Cache Compression 139
- 2024 11 27 Token Dropping 0
- 2024 11 27 ControlNet 0