Zerox
OCR & Document Extraction using vision models. Contribute to getomni-ai/zerox development by creating an account on GitHub.
A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense! Zerox is available as both a Node and Python package. (Node.js SDK - supports vision models from different providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, Google Gemini, etc.) The maintainFormat option tries to return the markdown in a consistent format by passing the output of a prior page in as additional context for the next page. This requires the requests to run synchronously, so it's a lot slower. But valuable if your documents have a lot of tabular data, or frequently have tables that cross pages. Zerox supports structured data extraction from documents using a schema. This allows you to pull specific information from documents in a structured format instead of getting the full markdown conversion. Use extractPerPage to extract data per page instead of from the whole document at once. Zerox supports a wide range of models across different providers: (Python SDK - supports vision models from different providers like OpenAI, Azure OpenAI, Anthropic, AWS Bedrock, etc.) The pyzerox.zerox function is an asynchronous API that performs OCR (Optical Character Recognition) to markdown using vision models. It processes PDF files and converts them into markdown format. Make sure to set up the environment variables for the model and the model provider before using this API. Refer to the LiteLLM Documentation for setting up the environment and passing the correct model name. Note the output is manually wrapped for this documentation for better readability. This project is licensed under the MIT License. OCR Document Extraction using vision models There was an error while loading. Please reload this page. There was an error while loading. Please reload this page. There was an error while loading. Please reload this page. There was an error while loading. Please reload this page.
Airbyte
Explore Airbyte, your go-to data integration platform and ELT tool. Seamlessly integrate, transform, and load data with our powerful, user-friendly so
Agent Engine Public Beta is now live! Airbyte provides the data infrastructure layer for ELT and AI agents, built on the same open-source foundation. Use batch and CDC replication for analytics, or direct connectors and context store to power agent workflows. Immediate ROI and productivity gains for your data teams. "With our legacy framework, if one of the pipelines fails for one client, it will stop everything for the rest of our clients. But with Airbyte, things are run in parallel because of the platform’s distributed nature, which means that we can process multiple clients at the same time without impacting performance." Raman Singh, Tech Lead at Symend "The real ROI is in our ability to iterate quickly, especially at our increasing scale. At the end of the day, you want a tool like that to just work. We can forget about it and know that it's configured and it's connecting and it's working. That hands-free capability is a big appeal for the platform.” Sean Carver, Director of Data at Petvisor "Unlike Fivetran's credit-based system that created budget uncertainty, Airbyte's pricing model allows Kuda to forecast expenses accurately and avoid surprise bills." Mondor La Grange, Head of BI and Data Engineering "What's different from Stitch Data or Informatica is the way that we can configure Airbyte connections and Airbyte entities through code. That's a huge plus to us as data engineers, because we are used to checking code and being able to manage changes from Github." Amy Zhao, Senior Manager of Data Engineering at Peloton "Airbyte allows us to stay flexible while scaling from hundred-million to billion-dollar enterprise clients." Franziska Ibscher, Product Manager at Drivepoint Data pipeline jobs synced in the last 24 hours The world's leading enterprises choose Airbyte to ensure data sovereignty and safeguard operations with enterprise-grade security-backed by dedicated support and SLAs they can trust. Cloud deployment with PrivateLink and multiple data region optionality provides flexibility for global businesses. Single Sign-On (SSO), SCIM provisioning, fine-grained RBAC, audit logs, and enterprise encryption standards. SOC 2 Type II certified, GDPR and HIPAA support, with tools to help you meet internal and external regulatory requirements. Optimized for high-throughput, low-latency pipelines that handle millions of records reliably at enterprise scale. 24/7 support, named customer success managers, and proactive monitoring to keep pipelines healthy. 99.9% availability backed by contractual SLAs and priority response times for mission-critical workloads. As the open source standard for the industry, Airbyte features an unparalleled developer community and resources for all users to achieve success. With over 25,000 users, our community is always ready to answer questions and offer tips to new users. Access our developer resources, such as user guides, blogs, videos and documentation. Explore our schedule of events
Zerox
Airbyte
Zerox
Pricing found: $50.10, $48.71, $48.71, $48.71, $9.74
Airbyte
Pricing found: $10 / month, $49 /month, $149 /month, $0.01 /call, $49/month
Only in Zerox (10)
Only in Airbyte (10)
Zerox
Airbyte