1 326 222

Gibran Iqbal PRO

Jibbscript

AI & ML interests

None yet

Recent Activity

liked a model about 14 hours ago

Roblox/roblox-pii-classifier

upvoted a paper about 14 hours ago

Linear representations in language models can change dramatically over a conversation

upvoted a paper about 14 hours ago

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

View all activity

Organizations

liked a model about 14 hours ago

Roblox/roblox-pii-classifier

Text Classification • 0.6B • Updated 5 days ago • 868 • 26

upvoted 4 papers about 14 hours ago

Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 2 days ago • 17

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 3 days ago • 19

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published 4 days ago • 67

Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Paper • 2601.20614 • Published 2 days ago • 110

liked a model 1 day ago

vidore/colpali-v1.3

Visual Document Retrieval • Updated Mar 14, 2025 • 33k • 87

upvoted 6 papers 2 days ago

AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking

Paper • 2601.17645 • Published 6 days ago • 22

FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning

Paper • 2601.18116 • Published 5 days ago • 11

Revisiting Parameter Server in LLM Post-Training

Paper • 2601.19362 • Published 4 days ago • 6

upvoted an article 3 days ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

4 days ago

•

upvoted 7 papers 3 days ago

Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow

Paper • 2601.14243 • Published 10 days ago • 19

Learning to Discover at Test Time

Paper • 2601.16175 • Published 8 days ago • 40

AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation

Paper • 2601.17761 • Published 6 days ago • 13

CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval

Paper • 2601.15849 • Published 9 days ago • 14

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published 5 days ago • 22

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 4 days ago • 36

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published 4 days ago • 120

Gibran Iqbal PRO

AI & ML interests

Recent Activity

Organizations

Jibbscript's activity

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective