Apache Airflow
Platform created by the community to programmatically author, schedule and monitor workflows.
Apache Airflow® has a modular architecture and uses a message queue to orchestrate an arbitrary number of workers. Airflow™ is ready to scale to infinity. Apache Airflow® pipelines are defined in Python, allowing for dynamic pipeline generation. This allows for writing code that instantiates pipelines dynamically. Easily define your own operators and extend libraries to fit the level of abstraction that suits your environment. Apache Airflow® pipelines are lean and explicit. Parametrization is built into its core using the powerful Jinja templating engine. No more command-line or XML black-magic! Use standard Python features to create your workflows, including date time formats for scheduling and loops to dynamically generate tasks. This allows you to maintain full flexibility when building your workflows. Monitor, schedule and manage your workflows via a robust and modern web application. No need to learn old, cron-like interfaces. You always have full insight into the status and logs of completed and ongoing tasks. Apache Airflow® provides many plug-and-play operators that are ready to execute your tasks on Google Cloud Platform, Amazon Web Services, Microsoft Azure and many other third-party services. This makes Airflow easy to apply to current infrastructure and extend to next-gen technologies. Anyone with Python knowledge can deploy a workflow. Apache Airflow® does not limit the scope of your pipelines; you can use it to build ML models, transfer data, manage your infrastructure, and more. Wherever you want to share your improvement you can do this by opening a PR. It’s simple as that, no barriers, no prolonged procedures. Airflow has many active users who willingly share their experiences. Have any questions? Check out our buzzing slack. Today we re launching the Apache Airflow Registry — a searchable catalog of every official Airflow provider and its modules, live at … The interactive report is hosted by Astronomer. The Apache Airflow community thanks Astronomer for running this survey, for sponsoring it … We are thrilled to announce the first major release of airflowctl 0.1.0, the new secure, API-driven command-line interface (CLI) for Apache … Apache Airflow Core, which includes webserver, scheduler, CLI and other components that are needed for minimal Airflow installation. Read the documentation Apache Airflow CTL (airflowctl) is a command-line interface (CLI) for Apache Airflow that interacts exclusively with the Airflow REST API. It provides a secure, auditable, and consistent way to manage Airflow deployments — without direct access to the metadata database. Read the documentation The Task SDK provides python-native interfaces for defining DAGs, executing tasks in isolated subprocesses and interacting with Airflow resources (e.g., Connections, Variables, XComs, Metrics, Logs, and OpenLineage events) at runtime. The goal of task-sdk is to decouple DAG authoring from Airflow internals (Scheduler, API Server, etc.), provid
Google Document AI
The Document AI solutions suite includes pretrained models for document processing, Workbench for custom models, and Warehouse to search and store.
Create document processors that help automate tedious tasks, improve data extraction, and gain deeper insights from unstructured or structured document information. Document AI helps developers create high-accuracy processors to extract, classify, and split documents. Seamlessly connect to BigQuery, Vertex Search, and other Google Cloud products Enterprise-ready, along with Google Cloud's data security and privacy commitments Built for developers; use the UI or API to easily create document processors Use generative AI to extract data or classify documents out of the box, with no training necessary to get started. Simply post a document to an enterprise-ready API endpoint to get structured data in return. Document AI is powered by the latest foundation models, tuned for document tasks. Also, with powerful fine-tuning and auto-labeling features, the platform offers multiple paths to reach the required accuracy. Structure and digitize information from documents to drive deeper insights using generative AI to help businesses make better decisions. Extract data from your documents using generative AI. For full product capabilities head to Document AI in the Google Cloud Console. Document AI Workbench provides an easy way to build custom processors to classify, split, and extract structured data from documents. Workbench is powered by generative AI, which means it can be used out of the box to get accurate results across a wide array of documents. Furthermore, you can achieve higher accuracy by providing as few as 10 documents to fine-tune the large model—all with a simple click of a button or an API call. With Enterprise Document OCR, users gain access to 25 years of optical character recognition (OCR) research at Google. OCR is powered by models trained on business documents and can detect text in PDFs and images of scanned documents in 200+ languages. The product can see the structure of a document to identify layout characteristics like blocks, paragraphs, lines, words, and symbols. Advanced features include best-in-class handwriting recognition (50 languages), recognizing math formulas, detecting font-style information, and extracting selection marks like checkboxes and radio buttons. Try Document OCR now for accurate text and layout extraction. Developers use Form Parser to capture fields and values from standard forms, to extract generic entities, including names, addresses, and prices, and to structure data contained in tables. This product works out of the box and does not require any training or customization and is useful across a broad range of document customization. Explore document processing with Form Parser. Try out pretrained models for commonly used document types including W2, paystub, bank statement, invoice, expense, US driver license, US passport, and identity proofing. Explore pretrained options in the processor gallery. Document AI is helping customers improve fraud detection, automate customer support, and pro
Apache Airflow
Google Document AI
Apache Airflow
Google Document AI
Pricing found: $300, $1.50, $0.60, $6, $6
Google Document AI (2)
Only in Apache Airflow (4)
Only in Google Document AI (10)
Apache Airflow
Google Document AI