Sync Apify Dataset data into BigQuery with Jitsu

Apify Dataset

BigQuery

Get Jitsu

Jitsu.Cloud is the easiest way to try out Jitsu. Pricing is volume based
Jitsu Open-Source edition is free and can be deployed with any infrastructure provider

About Apify Dataset integration

Apify is a web scraping and web automation platform providing both ready-made and custom solutions, an open-source SDK for web scraping, proxies, and many other tools to help you build and run web automation jobs at scale. The results of a scraping job are usually stored in Apify Dataset. This connector allows you to automatically sync the contents of a dataset to your chosen destination. To sync data from a dataset, all you need to know is its ID. You will find it in Apify console under storages.

About BigQuery

Google BigQuery is a fast, scalable, and easy-to-use data warehouse. Main advantages of Google BiqQuery are:
  • Serverless architecture.
  • Pay-as-you go
Jitsu can stream and batch data to Google BigQuery. Streaming will get data to BQ immediately, however Google charges for each streamed record, while batching is free. Streaming is the fastest way to get started, but batching will be cheaper for large volumes.
© Jitsu Labs, Inc

2261 Market Street #4109
San Francisco, CA 94114