Skip to main content

Dataiku Connector User Guide

Overview

The Dataiku Connector is used to connect the Dataiku Data Science Studio (DSS) with the Koverse Data Platform (KDP).

Installation

Prerequisite for using the Dataiku Connector is the installation of Dataiku (DSS). Installation instructions are available below:

The plugin can be installed from the Add Plugin page in Dataiku DSS:

Fetch From GitHub Repository

Repository URLs:

git@github.com:Koverse/dss-plugin-kdp4.git

or

https://github.com/Koverse/dss-plugin-kdp4.git

Add Plugin from GitHub Repository

The next step is to create a code environment for the plugin, which installs needed dependencies:

  • kdp-python-connector
  • kdp-api-python-client

Add Code Environment

After installation of the plugin, The Koverse KDP plugin will be accessible from the DATASET menu of the Flow in Dataiku DSS. Example:

Koverse KDP Plugin on Flow Page

Create Dataiku Preset

Under the Settings tab, create a preset for API configuration.

Plugin Settings

There are two supported authentication methods, basic_login and json web token. The KDP plugin will use the method you choose for authentication when connecting to KDP.

Basic Login

Preset Basic Login

KDP JSON Web Token

Preset JWT

Read/Write Data from Dataiku DSS to KDP

Select the Koverse KDP Dataset which is a Dataiku custom dataset for reading and writing data to KDP.

Select Dataset

Provide Required Parameters and Name for the Dataset

When using an existing KDP dataset, the existing data can be previewed:

Provide Export Dataset Parameters

When using the dataset to create a new KDP4 dataset, you can provide the dataset name.

Creating New KDP4 Dataset

After creation of the new dataset, you can select use_an_existing_dataset and put in the dataset_id of the new KDP4 dataset to preview the data.

Preview Dataset Data

For Development

Creating New Version of the Plugin

You can test changes outside DSS with the use_connector example (following the instructions contained in that example), or add additional examples/tests in the same pattern. It will require the dependencies listed in requirements.txt to be installed (code-env/python/spec/requirements.txt).

From kdp-Dataiku-connector root... (may require sudo, or use of venv)

pip install -r code-env/python/spec/requirements.txt

Usage

Testing in Dataiku DSS can be done by importing the plugin as detailed in the steps above. A feature branch can be targeted and imported. Additionally, once the plugin is installed, it can be converted to a development plugin in DSS by selecting the menu option from ACTIONS as seen here.

Convert to Developer Plugin

You can then edit in the DSS application. You will have to refresh any open DSS page after saving changes. DSS does have git integration; you can push changes out from DSS to the feature branch as you work with the plugin. The version should be manually updated in the plugin.json to the new minor version and also update the version in setup.py.