AI & ML interests
Text classification, relations extraction, NER, computational biology
Recent Activity
View all activity
A joint encoder-decoder GLiNER model for a scalable open-ontology entity recognition
The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type.
GLiClass with ModernBERT backbone
-
knowledgator/gliclass-modern-large-v2.0
0.4B • Updated • 52 • 3 -
knowledgator/gliclass-modern-base-v2.0
0.2B • Updated • 400 • 2 -
knowledgator/gliclass-base-v2.0-rac-init
Zero-Shot Classification • 0.2B • Updated • 28 • 11 -
knowledgator/gliclass-modern-base-v2.0-init
Zero-Shot Classification • 0.2B • Updated • 145 • 24
Bi-encoder and poly-encoder architectures of GLiNER
Generalist and Light-weighted Models for Zero-shot Text Classification
-
GLiClass SandBox
🌖13Classify text with zero-shot classification
-
knowledgator/gliclass-large-v1.0-init
Zero-Shot Classification • 0.4B • Updated • 11 • 14 -
knowledgator/gliclass-base-v1.0-init
Zero-Shot Classification • 0.2B • Updated • 7 • 2 -
knowledgator/gliclass-small-v1.0-init
Zero-Shot Classification • 0.1B • Updated • 6 • 5
Collection of the best zero-shot text classification models. Fine-tune them with few examples using LiqFit - https://github.com/Knowledgator/LiqFit.
-
knowledgator/comprehend_it-base
Zero-Shot Classification • Updated • 2.98k • • 86 -
MoritzLaurer/deberta-v3-large-zeroshot-v1.1-all-33
Zero-Shot Classification • 0.4B • Updated • 3.09k • • 58 -
MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7
Zero-Shot Classification • 0.3B • Updated • 51.3k • • 344 -
MoritzLaurer/deberta-v3-base-zeroshot-v1.1-all-33
Zero-Shot Classification • 0.2B • Updated • 8.4k • • 30
Collection of auto-regressive models tuned for text classification
Collection of pre-trained encoder models trained on large molecules databases.
PII detection models developed in collaboration with Wordcab
-
knowledgator/gliner-pii-large-v1.0
Token Classification • Updated • 2.41k • 33 -
knowledgator/gliner-pii-base-v1.0
Token Classification • Updated • 418 • 8 -
knowledgator/gliner-pii-small-v1.0
Token Classification • Updated • 30 • 6 -
knowledgator/gliner-pii-edge-v1.0
Token Classification • Updated • 211 • 10
Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy.
-
GLiClass: Generalist Lightweight Model for Sequence Classification Tasks
Paper • 2508.07662 • Published • 9 -
knowledgator/gliclass-edge-v3.0
Text Classification • 32.7M • Updated • 83 • 17 -
knowledgator/gliclass-modern-base-v3.0
Text Classification • 0.2B • Updated • 35 • 3 -
knowledgator/gliclass-modern-large-v3.0
Text Classification • 0.4B • Updated • 30 • 14
Collection of high-quality GLiNER models tuned for working with biomedical data
-
GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
Paper • 2504.00676 • Published • 5 -
Ihor/gliner-biomed-large-v1.0
Token Classification • Updated • 104 • 12 -
Ihor/gliner-biomed-base-v1.0
Token Classification • Updated • 94 • 4 -
Ihor/gliner-biomed-small-v1.0
Token Classification • Updated • 25 • 3
GLiNER models based on modern encoder architectures
Collection of initial models and models that use converted decoders to encoders as backbones
-
knowledgator/Qwen-encoder-0.5B
Question Answering • 0.5B • Updated • 10 • 9 -
knowledgator/Llama-encoder-1.0B
Question Answering • 1B • Updated • 387 • 3 -
knowledgator/Sheared-LLaMA-encoder-1.3B
Question Answering • 1B • Updated • 3 • 2 -
knowledgator/Qwen-encoder-1.5B
Question Answering • 2B • Updated • 8 • 2
Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks.
Knowledgator GLiNER models for information extraction
-
knowledgator/gliner-multitask-v1.0
Token Classification • Updated • 3.67k • 35 -
knowledgator/gliner-multitask-large-v0.5
Token Classification • Updated • 1.33k • 137 -
GLiNER HandyLab
⚡83Explore multiple text analysis tasks with GLiNER HandyLab
-
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks
Paper • 2406.12925 • Published • 25
Collection of models for converting chemical formats between each other.
-
knowledgator/SMILES2IUPAC-canonical-small
Text Generation • 5.97M • Updated • 349 • 7 -
knowledgator/IUPAC2SMILES-canonical-base
Text Generation • Updated • 65 • 6 -
knowledgator/IUPAC2SMILES-canonical-small
Text Generation • 5.79M • Updated • 5 • 5 -
knowledgator/SMILES2IUPAC-canonical-base
Text Generation • Updated • 10.1k • 9
Collection of datasest for various information extraction tasks.
PII detection models developed in collaboration with Wordcab
-
knowledgator/gliner-pii-large-v1.0
Token Classification • Updated • 2.41k • 33 -
knowledgator/gliner-pii-base-v1.0
Token Classification • Updated • 418 • 8 -
knowledgator/gliner-pii-small-v1.0
Token Classification • Updated • 30 • 6 -
knowledgator/gliner-pii-edge-v1.0
Token Classification • Updated • 211 • 10
A joint encoder-decoder GLiNER model for a scalable open-ontology entity recognition
Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy.
-
GLiClass: Generalist Lightweight Model for Sequence Classification Tasks
Paper • 2508.07662 • Published • 9 -
knowledgator/gliclass-edge-v3.0
Text Classification • 32.7M • Updated • 83 • 17 -
knowledgator/gliclass-modern-base-v3.0
Text Classification • 0.2B • Updated • 35 • 3 -
knowledgator/gliclass-modern-large-v3.0
Text Classification • 0.4B • Updated • 30 • 14
The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type.
Collection of high-quality GLiNER models tuned for working with biomedical data
-
GLiNER-biomed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
Paper • 2504.00676 • Published • 5 -
Ihor/gliner-biomed-large-v1.0
Token Classification • Updated • 104 • 12 -
Ihor/gliner-biomed-base-v1.0
Token Classification • Updated • 94 • 4 -
Ihor/gliner-biomed-small-v1.0
Token Classification • Updated • 25 • 3
GLiClass with ModernBERT backbone
-
knowledgator/gliclass-modern-large-v2.0
0.4B • Updated • 52 • 3 -
knowledgator/gliclass-modern-base-v2.0
0.2B • Updated • 400 • 2 -
knowledgator/gliclass-base-v2.0-rac-init
Zero-Shot Classification • 0.2B • Updated • 28 • 11 -
knowledgator/gliclass-modern-base-v2.0-init
Zero-Shot Classification • 0.2B • Updated • 145 • 24
GLiNER models based on modern encoder architectures
Bi-encoder and poly-encoder architectures of GLiNER
Collection of initial models and models that use converted decoders to encoders as backbones
-
knowledgator/Qwen-encoder-0.5B
Question Answering • 0.5B • Updated • 10 • 9 -
knowledgator/Llama-encoder-1.0B
Question Answering • 1B • Updated • 387 • 3 -
knowledgator/Sheared-LLaMA-encoder-1.3B
Question Answering • 1B • Updated • 3 • 2 -
knowledgator/Qwen-encoder-1.5B
Question Answering • 2B • Updated • 8 • 2
Generalist and Light-weighted Models for Zero-shot Text Classification
-
GLiClass SandBox
🌖13Classify text with zero-shot classification
-
knowledgator/gliclass-large-v1.0-init
Zero-Shot Classification • 0.4B • Updated • 11 • 14 -
knowledgator/gliclass-base-v1.0-init
Zero-Shot Classification • 0.2B • Updated • 7 • 2 -
knowledgator/gliclass-small-v1.0-init
Zero-Shot Classification • 0.1B • Updated • 6 • 5
Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks.
Collection of the best zero-shot text classification models. Fine-tune them with few examples using LiqFit - https://github.com/Knowledgator/LiqFit.
-
knowledgator/comprehend_it-base
Zero-Shot Classification • Updated • 2.98k • • 86 -
MoritzLaurer/deberta-v3-large-zeroshot-v1.1-all-33
Zero-Shot Classification • 0.4B • Updated • 3.09k • • 58 -
MoritzLaurer/mDeBERTa-v3-base-xnli-multilingual-nli-2mil7
Zero-Shot Classification • 0.3B • Updated • 51.3k • • 344 -
MoritzLaurer/deberta-v3-base-zeroshot-v1.1-all-33
Zero-Shot Classification • 0.2B • Updated • 8.4k • • 30
Knowledgator GLiNER models for information extraction
-
knowledgator/gliner-multitask-v1.0
Token Classification • Updated • 3.67k • 35 -
knowledgator/gliner-multitask-large-v0.5
Token Classification • Updated • 1.33k • 137 -
GLiNER HandyLab
⚡83Explore multiple text analysis tasks with GLiNER HandyLab
-
GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks
Paper • 2406.12925 • Published • 25
Collection of auto-regressive models tuned for text classification
Collection of models for converting chemical formats between each other.
-
knowledgator/SMILES2IUPAC-canonical-small
Text Generation • 5.97M • Updated • 349 • 7 -
knowledgator/IUPAC2SMILES-canonical-base
Text Generation • Updated • 65 • 6 -
knowledgator/IUPAC2SMILES-canonical-small
Text Generation • 5.79M • Updated • 5 • 5 -
knowledgator/SMILES2IUPAC-canonical-base
Text Generation • Updated • 10.1k • 9
Collection of pre-trained encoder models trained on large molecules databases.
Collection of datasest for various information extraction tasks.