ai_ocr / README.md
utarn's picture
Update README.md
162b367 verified
---
title: Omni API Gradio UI
emoji: 🤖
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.46.1
app_file: app.py
pinned: false
---
# Omni API Gradio UI
A Gradio-based user interface for the Omni API that supports text, PDF, image, and audio file processing.
## Features
- Text input for chat messages
- Multiple file upload support (PDF, images, audio)
- Configurable API base URL
- Real-time response display
- File ordering for multi-modal requests
## Installation
```bash
# Install dependencies
uv sync
# Run the application
uv run python app.py
```
### Development Mode (with auto-reload)
For development, you can use the auto-reload feature that will automatically restart the app when files change:
```bash
uv run python dev.py
```
This will monitor for changes in Python files, Markdown files, and TOML configuration files, automatically restarting the Gradio app when any of these files are modified.
## Usage
1. Configure the API base URL (defaults to https://api-omni.modelharbor.com)
2. Enter your text message
3. Upload files in the desired order (optional)
4. Click "Send Request" to interact with the API
5. View the response in the right panel
## Supported File Types
- **PDFs**: Document processing
- **Images**: JPG, PNG, GIF, BMP, WEBP
- **Audio**: MP3, WAV, M4A, FLAC, OGG
---
tags:
- gradio
- omni-api
- multimodal
- chat-interface
- pdf-processing
- image-processing
- audio-processing
- llm
- api-client