Skip to content

Connecting with Apify

The goal of this document

Here, we are going to connect the vendor called Apify to the Airbyte then it will redirect the data using the destination of MySQL, User can choose the format of the destination as required, For example from MySQL/ PostgreSQL, etc.

Useful Resources of Airbyte

Requirements to get the ‘apify’ source

Steps to Fetch the Dataset ID from apify.com

Here, we are going to fetch the Dataset Id instead of API key to make connection between the vendor and Airbyte.

  1. To fetch the Access Token of Apify, you should go to https://console.apify.com/sign-up

Image

  1. You need to login with your credentials.

Image

  1. Click on Storage option from left-menubar, as shown in figure below:

Image

  1. After getting on the storage page, click on datasets button to get in the dataset section of apify.com

Image

Steps to Connect apify.com with AIV

  1. Go to the Airbyte using the below link: https://airbyte.com/

  2. The Airbyte landing screen looks like below:

Image

  1. Here We have 3 options on the left, which contain Connections, Sources, and Destination. As per continuing the steps click on the Sources option.

Image

  1. To add the apify source, click on the New Source button from the Top-right corner.

Image

  1. To add Any of the sources here, the user needs to add the properties which are needed from the Airbyte, every source has its own individual properties.

Image

  1. Set up source Form Overview
  • Name: Users can add the name of the source as per their requirements.
  • Source Type: The user has to select the source type from the provided list by airbyte.
  • Dataset ID: Fetch the dataset id from apify.com and add here.

Image

  1. Now, set up the source here,
  • Add the name as Apify,
  • Select Apify Dataset from the source,
  • Add Dataset ID in the form,
  • Validate your form with as shown in the figure below:

Image

  • Now click on Set up Connection, it will test the connection here, then it will redirect to add destination of the source, as per figure below:

Image

Test connection successful alert.

Image

  1. After testing the connection successfully add the destination here.

Image

Here, on the source adding screen,

  • The top-menu bar shows two buttons where 1. for Overview of the source and 2. for the settings of the source.
  • Overview The overview screen shows the details related to the source. it may be empty as per the above screen.
  • Settings: From settings, the user can edit the form details of the added source.
  • Add destination button leads us to the destination page from this source page.
  1. Click on the Destination from the Top-right corner, as shown in the figure below:

Image

  1. Click on the MySQL Training here. it may take some time to fetch the stream names, as shown in figure below:

Image

  1. After loading all the Streams form the Vendor’s to the destination, the destination page looks as shown in the figure below:

Image

  1. After loading all the data streams user need to add Sync frequency and Table Prifix, as shown in the figure below:
  • Add Sync Frequency: Manual
  • Add Table Prefix: Apify_
  • As shown in the figure below:

Image

  • When the user add the prefix, it gets added on Destination stream name automatically, as shown in the figure below:

Image

  1. At the bottom of the destination page, we have two radio buttons, which contain the Normalization option. Keep the Basic Normalization selected.

Image

  1. Now, click on the setup connection button to complete the connection. it will test the connection again which may take some time.

  2. Now, to validate the data is synced by the Airbyte, Go to the Connection from the Left menu bar of the screen:

Image

  1. Find the Added connection of the Apify from the list,

Image

Here, the Data isn’t synced yet, for that it shows the icon of non-sync at the start of the row. click on the row to see the details.

  1. Here the Apify connection page will show the Status of the data sync and sync history.

Image

  1. The title of the Source and destination name.

  2. Status and settings page top menu bar.

  3. Enable button: from this button, the user can enable/disable the source from the destination.

  4. Reset your data and Sync button: the user can reset their data and only sync the updated data from the vendor.

  5. Click on the sync button to sync the data manually, after clicking on the sync button it will start the process which will indicate the status under the history grid.

Image