view article Article π¦βοΈ Using Llama3 and distilabel to build fine-tuning datasets Jun 4, 2024 β’ 79
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Paper β’ 2505.10468 β’ Published May 15 β’ 9
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking Paper β’ 2505.08581 β’ Published May 13 β’ 9
Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning Paper β’ 2505.09738 β’ Published May 14 β’ 10
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper β’ 2505.10558 β’ Published May 15 β’ 16
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper β’ 2505.09990 β’ Published May 15 β’ 12
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Paper β’ 2505.10320 β’ Published May 15 β’ 24
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper β’ 2505.10554 β’ Published May 15 β’ 120
Pleias-RAG Collection New generation of small reasoning models for RAG, search, and source summarization. β’ 4 items β’ Updated Apr 24 β’ 28
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning β’ 7 items β’ Updated 6 days ago β’ 60
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published Feb 4 β’ 249
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 β’ 1 item β’ Updated Dec 6, 2024 β’ 189
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis Paper β’ 2410.23320 β’ Published Oct 30, 2024 β’ 8