Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2407.20311

"Physics of Language Models" series

Physics of Language Models: Part 1, Context-Free Grammar

Paper • 2305.13673 • Published May 23, 2023 • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation

Paper • 2309.14402 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Paper • 2309.14316 • Published Sep 25, 2023 • 8

Papers - Math - Generate - Synthetic Data - CoT

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

Papers - Benchmarks - Math - Reasoning - GSM-Symbolic

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Paper • 2410.05229 • Published Oct 7, 2024 • 22
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28, 2024 • 38
Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 44
Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40

Papers - Math - GSM8K

Training Verifiers to Solve Math Word Problems

Paper • 2110.14168 • Published Oct 27, 2021 • 4
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 40
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 35

Papers - CoT - Arch - Reasoning - Layer Depth vs Wider Layer

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

Papers - Math - Generate Synthetic Data

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11, 2024 • 17
answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15 • 829k • 964
answerdotai/ModernBERT-large

Fill-Mask • 0.4B • Updated Jan 15 • 83.4k • 435
microsoft/phi-4

Text Generation • 15B • Updated 16 days ago • 479k • 2.2k

Papers - Math - Reasoning

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 46
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 22
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18
Premise Order Matters in Reasoning with Large Language Models

Paper • 2402.08939 • Published Feb 14, 2024 • 28

"Physics of Language Models" series

Physics of Language Models: Part 1, Context-Free Grammar

Paper • 2305.13673 • Published May 23, 2023 • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation

Paper • 2309.14402 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Paper • 2309.14316 • Published Sep 25, 2023 • 8

Papers - CoT - Arch - Reasoning - Layer Depth vs Wider Layer

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

Papers - Math - Generate - Synthetic Data - CoT

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

Papers - Math - Generate Synthetic Data

Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

Papers - Benchmarks - Math - Reasoning - GSM-Symbolic

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Paper • 2410.05229 • Published Oct 7, 2024 • 22
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process

Paper • 2407.20311 • Published Jul 29, 2024 • 5

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11, 2024 • 17
answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15 • 829k • 964
answerdotai/ModernBERT-large

Fill-Mask • 0.4B • Updated Jan 15 • 83.4k • 435
microsoft/phi-4

Text Generation • 15B • Updated 16 days ago • 479k • 2.2k

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28, 2024 • 38
Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22, 2024 • 65
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 44
Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40

Papers - Math - Reasoning

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 46
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Paper • 2404.02893 • Published Apr 3, 2024 • 22
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18
Premise Order Matters in Reasoning with Large Language Models

Paper • 2402.08939 • Published Feb 14, 2024 • 28

Papers - Math - GSM8K

Training Verifiers to Solve Math Word Problems

Paper • 2110.14168 • Published Oct 27, 2021 • 4
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 40
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 35

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs