Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
-
Updated
Aug 16, 2024 - Python
Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
Conversational Image Recognition Chatbot
drex-062225-exp (document retrieval and extraction expert) model is a specialized fine-tuned version of docscopeocr-7b-050425-exp, optimized for document retrieval, content extraction, and analysis recognition. built on top of the qwen2.5-vl architecture.
Extract structured menu information from images into JSON using a fine-tuned E2E model or LLM.
Add a description, image, and links to the image-text-to-text topic page so that developers can more easily learn about it.
To associate your repository with the image-text-to-text topic, visit your repo's landing page and select "manage topics."