Announcing a Firestore Connector for Apache Beam and Cloud Dataflow

By mullaned2002

November 8, 2021

688

Large scale data processing workloads can be challenging to operationalize and orchestrate. We’re excited to announce the release of a Firestore in Native Mode connector for Apache Beam to make data processing easier than ever for Firestore users. Apache Beam is an open source project that supports large scale data processing with a unified batch and streaming processing model. Beam is portable, works with many different backend runners, and allows for flexible deployment. The Firestore Beam I/O Connector joins BigQuery, Bigtable, and Datastore as Google databases with Apache Beam connectors. The Firestore I/O Connector is automatically included with theGoogle Cloud Platform IO module of the Apache Beam Java SDK.

The Firestore connector can be used with a variety of Apache Beam backends, including Google Cloud Dataflow. Dataflow, an Apache Beam backend runner, provides a structure for developers to solve “embarrassingly parallel” problems. Mutating every record of your database is an example of such a problem. Using Beam pipelines removes much of the work of orchestrating the parallelization and allows developers to instead focus on the transforms on the data.

The Firestore connector can be used in a simple way, the same way you would use other Beam connectors:

There are many possible applications for this connector for Google Cloud users. Joining disparate data in a Firestore in Native Mode database, relating data across multiple databases, deleting a large number of entities, writing Firestore data to BigQuery, and more. We’re excited to have contributed this connector to the Apache Beam ecosystem and can’t wait to see how you use the Firestore connector to build the next great thing.

Cloud BlogRead More

Previous articlePredict hospital readmission rates with Google Cloud Platform

Next articleHow Looker is helping marketers optimize workflows with first-party data

Announcing a Firestore Connector for Apache Beam and Cloud Dataflow

The overwhelmed person’s guide to Google Cloud: week of April 18

Your insider’s guide to Google Cloud Security at RSA Conference 2024

AI will break the stagnation in developer productivity, but only if you do it right

LEAVE A REPLY Cancel reply

Most Popular

The overwhelmed person’s guide to Google Cloud: week of April 18

AI will break the stagnation in developer productivity, but only if you do it right

Your insider’s guide to Google Cloud Security at RSA Conference 2024

2024 DORA survey now live: share your thoughts on AI, DevEx, and platform engineering

Recent Comments

EDITOR PICKS

Exploring the Click Element Variable in Google Tag Manager

How to track events with Google Tag Manager and Google Analytics

Data Layer Variable in GTM: What, Why, and Where?

POPULAR POSTS

Improved Alerting with Atlas Streaming Eval

A new Optimize feature to keep your website updated through COVID-19

Migrate a multi-TB SQL Server database to Amazon RDS Custom for SQL Server using Amazon S3 and Amazon EBS

POPULAR CATEGORY