Data Pipeline Features
This feature enables a Provider Data Scientist /Business Analyst/Data Engineer to execute the following actions in the Bristlecone NEO® Platform
- Create Data Pipeline
- Delete Data Pipeline
- Refresh: Refreshes the Data Pipeline list to populate the most recently created Data Pipelines
- Export Data: Exports all the pipeline list in Excel format
- Column Chooser: Displays selected columns
- Run a Data Pipeline: On demand trigger a Data Pipelinee
- Schedule a Data Pipeline: The Data Pipeline can be scheduled to be trigged at a specified
- Clone Data Pipeline: Copy/Clone an existing pipeline
- Edit Data Pipeline
Create Data Pipeline
The following are the parameters to create a Data Pipeline:
- Pipeline Name: A unique name of the Pipeline to be created
- Pipeline Description: A brief description on the Data Pipeline to be created
- Data Pipeline Type: There are two types of Data Pipeline Executions
- Batch : Enables the user to design the Data Pipeline to run in a specific timeline
- On File Drop: Enables the user to design the Data Pipeline for real time data ingestion
- Data Source Path: The source path from where the Data Pipeline execution will trigger
- Execution Flow: Define Execution Flow
Ex: 12 a.m. on Every Monday
Ex: IOT, web stream data
For expansive features and functionalities, please refer to Creating a Data Pipeline section
Delete Data Pipeline
To delete a Data Pipeline, click on individual Delete icon and click on Yes Button to confirm the deletion
Refresh Data Pipeline
To refresh data pipeline list, click on Refresh icon
Export all Pipeline
To export all the pipeline detailed list in excel format, click on Export icon. The list will be downloaded to local system
Column Chooser
This feature displays the columns where the checkboxes are selected inside column chooser panel as shown below
Run a Data Pipeline
This feature enables a Provider Data Scientist /Business Analyst/Data Engineer to trigger a Data Pipeline on Demand
For expansive features and functionalities, please refer to On Demand Trigger section on Triggering a data Pipeline module
Schedule a Data Pipeline
This feature enables a Provider Data Scientist /Business Analyst/Data Engineer to schedule a Data Pipeline to be trigged at a specified date/time
For expansive features and functionalities, please refer Scheduled Trigger section on Triggering a data Pipeline module
Clone a Data Pipeline
This feature enables a Provider Data Scientist /Business Analyst/Data Engineer to copy/clone an existing pipeline
For expansive features and functionalities, please refer to Clone Pipeline section in Creating a data Pipeline module.
Edit Data Pipeline
Step 1: To edit a Data Pipeline, click on Edit icon of the individual Pipeline
Step 2: Update the parameters as needed and click on Update button