-
Attention Is All You Need
Paper • 1706.03762 • Published • 109 -
LoRA Learns Less and Forgets Less
Paper • 2405.09673 • Published • 91 -
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper • 2401.02954 • Published • 51 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72
Mert Erbak PRO
merterbak
AI & ML interests
NLP and Image Processing
Recent Activity
liked
a model
about 17 hours ago
rhysjones/gpt2-124M-edu-fineweb-10B
liked
a model
about 20 hours ago
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
updated
a Space
about 21 hours ago
merterbak/Mistral-OCR