Zerox
OCR & Document Extraction using vision models. Contribute to getomni-ai/zerox development by creating an account on GitHub.
A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense! Zerox is available as both a Node and Python package. (Node.js SDK - supports vision models from different providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, Google Gemini, etc.) The maintainFormat option tries to return the markdown in a consistent format by passing the output of a prior page in as additional context for the next page. This requires the requests to run synchronously, so it's a lot slower. But valuable if your documents have a lot of tabular data, or frequently have tables that cross pages. Zerox supports structured data extraction from documents using a schema. This allows you to pull specific information from documents in a structured format instead of getting the full markdown conversion. Use extractPerPage to extract data per page instead of from the whole document at once. Zerox supports a wide range of models across different providers: (Python SDK - supports vision models from different providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, etc.) The pyzerox.zerox function is an asynchronous API that performs OCR (Optical Character Recognition) to markdown using vision models. It processes PDF files and converts them into markdown format. Make sure to set up the environment variables for the model and the model provider before using this API. Refer to the LiteLLM Documentation for setting up the environment and passing the correct model name. Note the output is manually wrapped for this documentation for better readability. This project is licensed under the MIT License. OCR Document Extraction using vision models There was an error while loading. Please reload this page. There was an error while loading. Please reload this page. There was an error while loading. Please reload this page. There was an error while loading. Please reload this page.
Prisma
Prisma is a next-generation Node.js and TypeScript ORM for PostgreSQL, MySQL, SQL Server, SQLite, MongoDB, and CockroachDB. It provides type-safety, a
Based on the provided social mentions, users view Prisma primarily as an experimental AI/ML tool for research and development purposes. The main strengths appear to be its interpretability features and architecture visualization capabilities, with developers appreciating its potential for understanding model internals and data flow. Key complaints center around it being described as a "crap prototype" by its own creators, suggesting it's still in early development stages with significant limitations. There's no clear pricing sentiment from these mentions, as discussions focus more on technical experimentation rather than commercial use. Overall, Prisma seems to have a niche reputation among AI researchers and developers as an interesting but unpolished tool for model interpretability work.
Zerox
Prisma
Zerox
Pricing found: $50.10, $48.71, $48.71, $48.71, $9.74
Prisma
Pricing found: $0 / month, $10 / month, $0.0080, $2.00, $49 / month
Only in Zerox (10)
Zerox
No data yet
Prisma
Zerox
Prisma