Tellius
  • 🚩Getting Started
    • 👋Say Hello to Tellius
      • Glossary
      • Tellius 101
      • Navigating around Tellius
    • ⚡Quick Start Guides
      • Search
      • Vizpads (Explore)
      • Insights (Discover)
    • ✅Best Practices
      • Search
      • Vizpads (Explore)
      • Insights (Discover)
      • Predict
      • Data
    • ⬇️Initial Setup
      • Tellius architecture
      • System requirements
      • Installation steps for Tellius
      • Customizing Tellius
    • Universal Search
    • 🏠Tellius Home Page
    • ❓FAQs
      • Kaiya Conversational AI
      • Data Preparation FAQs
      • Environment FAQs
      • Search FAQs
      • Vizpads FAQs
      • Data Caching FAQs
      • Embedding FAQs
      • Insights FAQs
  • Kaiya
    • ♟️Understanding AI Agents & Agentic Flows
      • Glossary
      • Composer
      • 🗝️Triggering an agentic workflow
      • The art of possible
      • Setting up LLM for Kaiya
    • 🤹Kaiya conversational AI
      • Triggering Insights with "Why" questions
      • Mastering Kaiya conversational AI
      • 📒Kaiya Learnings
      • Kaiya Terms of Service
  • 🔍Search
    • 👋Get familiar with our Search interface
    • 🤔Understanding Tellius Search
    • 📍Search Guide
    • 🚀Executing a search query
      • Selecting a Business View
      • Typing a search query
      • Constructing effective search queries
      • Marketshare queries
    • 🔑Analyzing search results
      • Understanding search results
      • Search Inspector
      • Time taken to execute a query
      • Interacting with the resulting chart
    • 📊Know your charts in Tellius
      • Understanding Tellius charts
      • Variations of a chart type
      • Building charts from Configuration pane
      • List of chart-specific fields
      • Adding columns to fields in Configuration pane
      • Absolute and percentage change aggregations
      • Requirements of charts
      • Switching to another chart
      • Formatting charts
      • Advanced Analytics
      • Cumulative line chart
    • 🧑‍🏫Help Tellius learn
    • 🕵️‍♂️Search history
    • 🎙️Voice-driven search
    • 🔴Live Query mode
  • 📈Vizpads (Explore)
    • 🙋Meet Vizpads!
    • 👋Get familiar with our Vizpads
    • #️⃣Measures, dimensions, date columns
    • ✨Creating Vizpads
    • 🌐Applying global filters
      • Filters in multi-BV Vizpads
      • Filters using common columns
    • 📌Applying local filters
    • 📅Date picker in filters
      • Customizing the calendar view
    • ✅Control filters
      • Multi-select list
      • Single-select list
      • Range slider
      • Dropdown list
    • 👁️Actions in View mode
      • Interacting with the charts
    • 📝Actions in Edit mode
      • 🗨️Viz-level actions
    • 🔧Anomaly management for line charts
      • Instance level
      • Vizpad level
      • Chart level
    • ⏳Time taken to load a chart
      • Instance level
      • Vizpad level
      • Chart level
    • ♟️Working with sample datasets
    • 🔁Swapping Business View of charts
      • Swapping only the current Vizpad
      • Swapping multiple objects
      • Configuring the time of swap
    • 🤖Explainable AI charts
  • 💡Insights (Discover)
    • 👋Get familiar with our Insights
    • ❓Understanding the types of Insights
    • 🕵️‍♂️Discovery Insights
      • Partitioning for JDBC Datasets
    • ➕How to create new Insights
      • 🔛Creating Discovery Insight
      • 🔑Creating Key Driver Insights
      • 〰️Creating Trend Insights
      • 👯Creating Comparison Insights
    • 🧮The art of selecting columns for Insights
      • ➡️How to include/exclude columns?
  • 🔢Data
    • 👋Get familiar with our Data module
    • 🥂Connect
    • 🪹Create new datasource
      • Connecting to Oracle database
      • Connecting to MySQL database
      • Connecting to MS SQL database
      • Connecting to Postgres SQL database
      • Connecting to Teradata
      • Connecting to Redshift
      • Connecting to Hive
      • Connecting to Azure Blob Storage
      • Connecting to Spark SQL
      • Connecting to generic JDBC
      • Connecting to Salesforce
      • Connecting to Google cloud SQL
        • Connecting to a PostgreSQL cloud SQL instance
        • Connecting to an MSSQL cloud SQL instance
        • Connecting to a MySQL Cloud SQL Instance
      • Connecting to Amazon S3
      • Connecting to Google BigQuery
        • Steps to connect to a Google BigQuery database
      • Connecting to Snowflake
        • OAuth support for Snowflake
        • Integrating Snowflake with Azure AD via OAuth
        • Integrating Snowflake with Okta via OAuth
        • Azure PrivateLink
        • AWS PrivateLink
        • Best practices
      • Connecting to Databricks
      • Connecting to Databricks Delta Lake
      • Connecting to an AlloyDB Cluster
      • Connecting to HDFS
      • Connecting to Looker SQL Interface
      • Loading Excel sheets
      • 🚧Understanding partitioning your data
    • ⏳Time-to-Live (TTL) and Caching
    • 🌷Refreshing a datasource
    • 🪺Managing your datasets
      • Swapping datasources
    • 🐣Preparing your datasets
      • 🤾Actions that can be done on a dataset
      • Data Pipeline
      • SQL code snippets
      • ✍️Writeback window
      • 🧩Editing Prepare → Data
      • Handling null or mismatched values
      • Metadata view
      • List of icons and their actions
        • Functions
        • SQL Transform
        • Python Transform
        • Standard Aggregation
        • Creating Hierarchies
      • Dataset Scripting
      • Fusioning your datasets
      • Scheduling refresh for datasets
    • 🐥Preparing your Business Views
      • 🌟Create a new Business View
      • Creating calculated columns
      • Creating dynamic parameters
      • Scheduling refresh for Business Views
      • Setting up custom calendars
    • Tellius Engine: Comparison of In-Memory vs. Live Mode
    • User roles and permissions
  • Feed
    • 📩What is a Feed in Tellius?
    • ❗Alerts on the detection of anomalies
    • 📥Viewing and deleting metrics
    • 🖲️Track a new metric
  • Assistant
    • 💁Introducing Tellius Assistant
    • 🎤Voice-based Assistant
    • 💬Interacting with Assistant
    • ↖️Selecting Business View
  • Embedding Tellius
    • What you should know before embedding
    • Embedding URL
      • 📊Embedding Vizpads
        • Apply and delete filters
        • Vizpad-related actionTypes
        • Edit, save, and share a Vizpad
        • Keep, remove, drill sections
        • Adding a Viz to a Vizpad
        • Row-level policy filters
      • 💡Embedding Insights
        • Creating and Viewing Insights
      • 🔎Embedding Search
        • Search query execution
      • Embedding Assistant
      • 🪄Embedding Kaiya
      • Embedding Feed
  • API
    • Insights APIs
    • Search APIs
    • Authentication API (Login API)
  • ✨What's New
    • Release 5.4
      • Patch 5.4.0.x
    • Release 5.3
      • Patch 5.3.1
      • Patch 5.3.2
      • Patch 5.3.3
    • Release 5.2
      • Patch 5.2.1
      • Patch 5.2.2
    • Release 5.1
      • Patch 5.1.1
      • Patch 5.1.2
      • Patch 5.1.3
    • Release 5.0
      • Patch 5.0.1
      • Patch 5.0.2
      • Patch 5.0.3
      • Patch 5.0.4
      • Patch 5.0.5
    • Release 4.3 (Fall 2023)
      • Patch 4.3.1
      • Patch 4.3.2
      • Patch 4.3.3
      • Patch 4.3.4
    • Release 4.2
      • Patch 4.2.1
      • Patch 4.2.2
      • Patch 4.2.3
      • Patch 4.2.4
      • Patch 4.2.5
      • Patch 4.2.6
      • Patch 4.2.7
    • Release 4.1
      • Patch 4.1.1
      • Patch 4.1.2
      • Patch 4.1.3
      • Patch 4.1.4
      • Patch 4.1.5
    • Release 4.0
Powered by GitBook

© 2025 Tellius

On this page

Was this helpful?

Export as PDF
  1. Getting Started
  2. FAQs

Data Preparation FAQs

PreviousKaiya Conversational AINextEnvironment FAQs

Last updated 15 hours ago

Was this helpful?

What is a Business View?

A Business View is a logical data model. Consider a logical collection of tables that are linked by a common key (date, ID etc). When building a Business View, one has to build a dataset for modeling rather than creating a relational database.

What happens if Business Views are downloaded as CSV?

The portion displayed under Data -> Business View will be downloaded as a single table.

What is the difference between a Business View and a Project?

A Business View is the logical data model used to drive Search, Vizpads, Insights, and Models. A Project is akin to a folder that contains a collection of Vizpads, Insights, and Models for a particular Business View. Once a project is shared with other users, all of its contents are shared automatically.

Can I create a new dataset within Tellius using SQL or Python?

No, you can only add or remove columns in the existing dataset. A self-join of datasets is possible in some instances.

What is the difference between column type and feature type?

Column types can be measures or dimensions. Feature types can be categorical or continuous. Check out the documentation on and .

How do I refer to null values in the signature function? I have tried multiple versions, and none work (null, NULL, “NULL”, “null”).

For this purpose, you need to use SQL or Python. Functions do not support reference to null values.

How do I set the incremental data refresh, i.e., do not replace it, but append it during a refresh?

Only full replacement is available for local files. Incremental can be set up for databases.

Where is the data located after the connection is established?

The data is loaded into memory and also stored on disk.

How to handle actual time stamp columns? For example, converting a string "12:12:11" to a timestamp, using hh:mm:ss format does not work as the date is set to Jan 1, 1970.

This manipulation need to be done using SQL or Python.

If I need to go back into the data pipeline and modify a node, will the subsequent node be updated?

Yes. SQL, Python, and advanced filter nodes can be modified in the middle of the data pipeline without having to modify the subsequent node. If a node is no longer needed, keep the node without a script.

The values with dollar signs are imported as strings. How to handle the transformation into a num?

It's recommended to remove the dollar sign, comma, and periods (for decimals) using find & replace via SQL or Python. Then, it needs to be converted into a number. Or, you could directly convert it into a number via SQL or Python.

What is Live query mode? What are the cons of that approach?

Live datasets reside in the source, and they are not imported to Tellius. So for each query, the data needs to be fetched from the source, which could take a bit of time (when compared to non-live datasets). It can only be used for driving Search and Vizpads, but not Insights or ML models.

What are the databases that work with the Live query engine?

Redshift, Snowflake, Bigquery, Teradata work with Live query engines.

What is the difference between measures and dimensions?

Is there a way to see an audit trail of each transformation made on the dataset so I can undo/revert changes?

Yes. Data Pipeline contains the audit trail of all transformations done to a dataset. The pipeline is tied to a dataset. So, if a dataset is deleted, its pipeline will also be deleted.

Can we incorporate formulas, nested queries, or complex joins within the search criteria and join them with specific keywords?

It is absolutely possible to create formulas, nested queries, and complex joins, which can be used in search queries. New business KPIs and calculated metrics can also be created and used in Search and Vizpads.

Does Tellius support the loading of large data sets in bulk? If so, what are the relative speeds?

Yes. Tellius supports live connections and loading in bulk. The speed depends on the size of the data, resources for the instance, and underlying data source.

When using a SQL script for a live connection, how does it run, and what's the impact on performance? For instance, will a window function in the script execute with every front-end query?

In live connections, any SQL script, including a window function, will execute with each Search/Vizpad query run. To avoid this, consider creating a Materialized View on the source system and then load it to Tellius.

What SQL dialect is used for calculated columns within live connections?

The SQL dialect for calculated columns is compatible with both Spark and the source system from which the data is loaded.

What is the expected behavior when the FQE is toggled on for a live Business View (BV)?

Enabling FQE on a live BV is not applicable. If toggled on, the system will not ingest the data into FQE; live connection status will be maintained. Please avoid using the FQE toggle for live BVs.

Measures are quantitative values (e.g., profit, sales, weight) for which you can apply aggregation. Dimensions are qualitative values (e.g., city, name) for which aggregation cannot be applied. Check out for more details.

🚩
❓
column type
feature type
this section