Shawon Ashraf's picture

Shawon Ashraf

shawon

·

https://shawonashraf.github.io

AI & ML interests

Multi-Modal ML, Learning Dynamics in Larger Models, XAI

Recent Activity

liked a model 14 days ago

BSC-LT/salamandra-7b-instruct

liked a Space 14 days ago

openeurollm/LLM-leaderboard

liked a model 14 days ago

inclusionAI/LLaDA2.0-flash

View all activity

Organizations

upvoted a paper 2 months ago

MixReasoning: Switching Modes to Think

Paper • 2510.06052 • Published Oct 7 • 21

upvoted an article 6 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Jun 4, 2024

•

79

upvoted a collection 6 months ago

Any-to-Any Models, Datasets, Spaces

18 items • Updated Jun 20 • 29

upvoted 8 papers 7 months ago

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

Paper • 2505.10468 • Published May 15 • 9

ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking

Paper • 2505.08581 • Published May 13 • 9

Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning

Paper • 2505.09738 • Published May 14 • 10

Style Customization of Text-to-Vector Generation with Image Diffusion Priors

Paper • 2505.10558 • Published May 15 • 16

Depth Anything with Any Prior

Paper • 2505.10565 • Published May 15 • 12

PointArena: Probing Multimodal Grounding Through Language-Guided Pointing

Paper • 2505.09990 • Published May 15 • 12

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published May 15 • 24

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12

•

568

upvoted 2 collections 8 months ago

Pleias-RAG

New generation of small reasoning models for RAG, search, and source summarization. • 4 items • Updated Apr 24 • 28

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 6 days ago • 60

upvoted an article 8 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Jul 29, 2024

•

364

upvoted a collection 10 months ago

SigLIP2

36 items • Updated Jul 10 • 95

upvoted a paper 10 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 249

upvoted a collection about 1 year ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 189

upvoted a paper about 1 year ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted a collection about 1 year ago

LongVU

7 items • Updated Oct 31, 2024 • 35