Skip to content

Connection Selection Tab Terminology

The Connection Selection tab is the second step in the Create Dataset workflow. After you choose a data source type, this tab guides you through selecting or configuring the specific connection or file that will supply data for your dataset.

This tab helps you:

  • Connect to existing database connections, cloud sources, or file-based data
  • Filter and search available connections when many are configured
  • Create new connections when no suitable datasource exists
  • Upload or select files for Excel, CSV, and JSON sources
  • Proceed to query or configuration based on the selected source type

Refer to the dataset creation guide to navigate to the Data Source Selection step and then access the Connection Selection tab.


Connection Selection by Source Type

The options and steps on the Connection Selection tab depend on the data source type you selected in Step 1. The following sections describe each workflow.


1. Database and Connected Sources

Purpose

This workflow applies when your data comes from a relational database, NoSQL store, or a connected cloud or endpoint service. You either select an existing connection or create a new one.

Applies To

  • Existing Connections (JDBC, MySQL, PostgreSQL, Oracle, SQL Server)
  • NoSQL
  • Google Sheets
  • Google BigQuery
  • SharePoint
  • Data Endpoints

Steps

  1. The system displays all configured connections for the selected source type.

  2. Use the Search Bar to filter connections by name or type.

  3. Select an existing connection (for example, SampleDB from the list of available database connections).

    Connection Selection – Database and Connected Sources

    OR

  4. If no suitable connection exists, click Create Datasource to configure a new connection.

    Create Datasource

  5. Click the selected datasource (for example, SampleDB).

  6. The system navigates to Step 3 – General Tab (Query Configuration).

Key Points

  • Only connections compatible with the chosen data source type are shown.
  • Search helps when many connections are configured.
  • Creating a new datasource may require connection details and credentials.

2. Excel, CSV, and JSON

Purpose

For file-based sources (Excel, CSV, JSON), you provide the actual file either by uploading it or by choosing a previously uploaded file.

Steps

  1. Click the Upload button and select a file from your local machine, or drag and drop the file into the upload area.

    Upload File for Excel, CSV, or JSON

    OR

  2. Select a previously uploaded file (for example, retail_sales.csv) from the list.

    Select Previously Uploaded File

  3. Click the selected file (for example, retail_sales.csv) to proceed to Step 3 – General Tab (Query Configuration).

Key Points

  • Supported formats include Excel (.xlsx, .xls), CSV, and JSON.
  • Uploaded files are retained for reuse in other datasets.
  • Ensure the file structure and encoding match your expectations before proceeding.

3. Parquet

Purpose

When Parquet is selected as the data source type, no separate connection or file selection step is required on this tab. The system takes you directly to query configuration.

Behavior

The system opens Step 3 – General Tab directly, where you can write a SQL query to retrieve and transform Parquet data.

Connection Selection – Parquet

Key Points

  • Parquet sources are configured at the query level in the General Tab.
  • Ensure Parquet storage or catalog connection is available as per your environment setup.

4. Flat Files (Manual Entry Only)

Purpose

When Flat Files is selected, the system skips connection selection and opens the data editor so you can enter or paste structured data manually.

Behavior

The system opens the data editor in Step 3 – General Tab, where you can type or paste structured data in formats such as CSV, Excel, TSV, PSV, or JSON.

Connection Selection – Flat Files

Key Points

  • Ideal for small, ad-hoc datasets or quick prototyping.
  • Data is entered or pasted directly; no file upload or connection is required.
  • Ensure column separators and structure match the chosen format.

5. API Endpoint

Purpose

When API Endpoint is selected as the data source type, the system takes you directly to the configuration screen where you define how the API will be called.

Behavior

The system opens Step 3 – General Tab, where you configure API details such as URL, HTTP method, headers, parameters, and request body.

Connection Selection – API Endpoint

Key Points

  • No pre-configured connection is required; API details are entered in the General Tab.
  • Have the API documentation (URL, method, auth, parameters) available before configuring.
  • Use Preview or test options to validate the API response before saving the dataset.

Summary

The Connection Selection tab ensures that you:

  • Select or create the right connection for database and connected sources
  • Upload or choose the correct file for Excel, CSV, and JSON
  • Proceed appropriately for Parquet, Flat Files, and API Endpoint without extra connection steps

By completing this step correctly, you set the foundation for configuring queries and output in the General and Output Columns tabs.