List of icons and their actions
Descriptions and purpose of each icon found under Data -> Prepare
Last updated
Was this helpful?
Descriptions and purpose of each icon found under Data -> Prepare
Last updated
Was this helpful?
Below is a concise reference guide for each icon found above the data pipeline.
Reverts your latest pipeline change (e.g., removing a column, adding a step).
If you used Undo but decide to keep that action, click Redo. The undone step reappears.
Discards all pipeline changes and rolls back the entire dataset to its original, pre-transformation state.
Summarize or group data by performing aggregations (COUNT, SUM, AVERAGE, etc.). It has the following tabs:
Standard: Basic grouping by field, plus a date/time column if needed.
Time-Series: Specifically for date/time-based grouping (with โResolutionโ like daily, weekly).
Pivot: Create a pivot table by selecting row/column dimensions and an aggregation measure.
Build complex multi-condition filters for the dataset pipeline. Choose the required column and the action to be applied. Use "+" to chain multiple conditions for more nuanced data curation. Once the filters are applied, only rows matching these conditions will remain in your pipeline.
Displays options to export or write back the current dataset to a target (e.g., CSV, HDFS, Snowflake). For more details, check out page.
Performs advanced formula-based transformations, subdivided into Indicators and Signatures. For more details, check out page.
Allows you to write or apply raw SQL transformations in the data pipeline. Ideal for complex joins, subqueries, and pushdown operations that merge multiple datasets or apply advanced filtering/aggregations directly within the databaseโs query engine. For more details, check out page.
Allows you to write or apply Python script-based transformations (via PySpark or Pandas) in the data pipeline. For example, sophisticated data wrangling, machine learning feature prep, or text processing. Use PySpark for large-scale distributed data or Pandas for smaller local data. For more details, check out page.
Defines hierarchical relationships and lets you organize dimension columns into logical levels, such as Country โ State โ City. For more details, check out page.