Manohar Allu
manohar03
AI & ML interests
None yet
Recent Activity
updated
a collection
about 1 month ago
Audio
liked
a model
about 2 months ago
tencent/HY-MT1.5-1.8B
updated
a collection
about 2 months ago
Mobile Testing
Organizations
Text-to-video
merged
ROBOTICS
General Purpose
VLM
-
Runtime errorFeatured454
OmniParser V2
🏢454OmniParser, turn your LLM into GUI agent
-
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 44 -
microsoft/Magma-8B
Robotics • Updated • 430 • 414 -
mlfoundations/Gelato-30B-A3B
Image-Text-to-Text • Updated • 164 • 31
Mobile Testing
tech
Text-to-video
Embed
merged
Medical
ROBOTICS
text-to-image
General Purpose
Audio
VLM
-
Runtime errorFeatured454
OmniParser V2
🏢454OmniParser, turn your LLM into GUI agent
-
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 44 -
microsoft/Magma-8B
Robotics • Updated • 430 • 414 -
mlfoundations/Gelato-30B-A3B
Image-Text-to-Text • Updated • 164 • 31