Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Michael Fromm's picture
13 5 25

Michael Fromm

mfromm
Bjornsund's profile picture prashil0202's profile picture johannhartmann's profile picture
·
https://fromm-m.github.io/fromm/
  • effi288
  • fromm-m
  • michael-fromm-a2069772

AI & ML interests

NLP, LLM, ConvAI

Recent Activity

updated a dataset 1 day ago
Eurolingua/test
published a dataset 1 day ago
Eurolingua/test
published a dataset 10 days ago
openGPT-X/leaderboard_data_ogx
View all activity

Organizations

Fraunhofer Institute for Intelligent Analysis and Information Systems's profile picture OpenGPT-X's profile picture Lamarr's profile picture Modalities's profile picture EuroLingua-GPT's profile picture EuropeanLLM-Beta's profile picture EuropeanLLM-Eval's profile picture Lamarr LLM Development's profile picture TrustLLM EU's profile picture OpenGPT-X's profile picture JupiterAI's profile picture JQL-AI's profile picture JackalFactory's profile picture Jackal-Factory's profile picture Jackal-AI's profile picture Stealth Nomis 's profile picture

upvoted a collection about 1 month ago

Nemotron-Pre-Training-Datasets

Collection
Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 18 days ago • 97
upvoted a paper 4 months ago

Tokenizer Choice For LLM Training: Negligible or Crucial?

Paper • 2310.08754 • Published Oct 12, 2023 • 3
upvoted an article 8 months ago
view article
Article

SmolLM3: smol, multilingual, long-context reasoner

  • +21
Jul 8, 2025
•
756
upvoted a paper 9 months ago

Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

Paper • 2505.22232 • Published May 28, 2025 • 18
upvoted a collection over 1 year ago

EU20-Benchmarks

Collection
Evaluation Benchmarks for 20 European languages. • 5 items • Updated Oct 11, 2024 • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs