Ada

Semantics

How to Connect Data

1. Supported Data Sources

You can connect the following types of data sources:

  • JDBC (Java Database Connectivity):
    • MySQL
    • PostgreSQL
  • Local Files:
    • Local files: Upload .csv, .txt, .xlsx, .xls formats from your computer..
    • S3 Data Source: Configure S3 connection with your Region, Endpoint, Bucket Name, Access Key Type, Access Key ID, Secret, and File Path.

2. Add a new data source

  • Navigate to Semantic Studio from the main navigation menu.

  • Click on Data Connection under the Semantic Factory section to manage data sources.

  • Click Connect Data to add a new data table.

![Connet Data-EN-20250709](/images/Connet Data-EN-20250709.png)

  • Once a data source is connected or uploaded, you can edit the table name, the type and description of each column.

EditTable-EN-20250709

3. View & Manage Data Sources

  • All connected data sources will appear in the Data Source.

  • From here, you can:

    • Search: Search by Table Name

    • View Details: View the data schema and sample rows

    • Delete or Disable: Remove inactive or outdated sources

![Data Sources-EN-20250709](/images/Data Sources-EN-20250709.png)

How to Build Semantics

1. Overview:

Semantics translates raw tables and fields into reusable dimensions and metrics, making it easy for Ada to assist you with data analysis and natural language queries.

  • Semantic Model: A virtual data model built from one or more connected tables. It defines how your data is joined, how business concepts are mapped to fields, and how metrics are calculated.

  • Dimension: Attributes you can use to break down or filter data. (e.g. Time, Categories, Status.)

  • Metric: Numerical values used to measure business performance. You can create:

    • Basic Metrics: direct aggregations like SUM(Sales) or COUNT(Users)
    • Composite Metrics: calculated from other metrics, e.g., Profit Margin = (Revenue - Cost) / Revenue

2. Create a Semantic Model

  • Click “New Model” to begin building your semantic layer.

  • Select one or more tables from your connected data sources.

  • Click “+Associate Table” to join another table. You can choose the join types and joined dimensions here.

Join

3. Create Dimension

  • Click “+Create Dimension” to add dimensions.

  • Select Data Source and Dimension Calculation. Add more Dimension by clicking "Add Custom Dimension".

![Create Dimension](/images/Create Dimension.png)

4. Create Metric

  • Click “Create Metric to add Metrics. You can enter Name, Description and Category for it. You can also adjust the Number Format Settings on its format type, unit, decimal place or thousand separator.

  • Choose the configuration mode between Basic Metric and Composite Metric:

    • To create a Basic Metric, select a saved data source, a calculation field, and its aggregation mode. You can also enable the Point Time Attribute to make the metric time-aware and restrict it to data available at that specific point in time.
    • To create a Composite Metric, define a formula using one or more existing metrics. You can apply arithmetic operations (e.g. +, -, *, /) to combine metrics into a new calculated result.

![Create Metric](/images/Create Metric.png)

5. Manage Models/Dimensions/Metrics

  • All created models/dimensions/metrics are stored in the Semantics in "Semantics Factory", where you can:
    • Search, view and edit existing models/dimensions/metrics.
    • Categorize metrics by adding "New Category" or editing existing categories.

Smart Semantic Documentation

Overview

Smart Semantic automates the creation of a semantic layer (tables, data models, dimensions, metrics) from your uploaded data files (Excel/CSV) using large language models (LLMs). Once processed, you can instantly query the data using natural language to uncover insights—no manual modeling required.

Key Benefits

  • Time Efficiency: Skip manual schema design.

  • Zero Modeling Expertise Needed: LLMs handle data interpretation.

  • Instant Querying: Ask questions immediately after processing.

  • Broad Use Cases: Analyze sales data, survey results, logs, etc.

Workflow

  1. Upload Files

    • Supported Formats: .xlsx, .xls, .csv (max 100MB/file).
    • Requirements:
      • Clear headers (Row 1 must contain column names).
      • Consistent data types per column (e.g., avoid mixing text/numbers in one column).
    • How to Upload:
      • Option 1: Chat Interface Upload. Locate the upload icon in the chat input section, after clicking you can upload files.
      • Option 2: Semantic Studio Module. Navigate to Semantic Studio → Click "Smart Semantics" button and upload files.

    ⚠️ Note: Sensitive data? Use pseudonymization before uploading.

  2. Submit for Processing

    • Click Submit after uploading.

    • The system validates file structure (e.g., checks for empty cells, encoding issues).

  3. Wait for Results

    • Progress Tracking: You can check the status by hovering on the button "AI Model Creation Progress".

    • What Happens:

      • LLMs analyze headers, sample rows, and relationships.
      • Output includes:
        • Tables & Data Models: Logical groupings (e.g., Orders, Customers).
        • Dimensions: Categorical fields (e.g., Product Category, Region).
        • Metrics: Aggregations (e.g., Total Sales = SUM(Revenue)).

    ⏱️ Processing Time: Scales with file size (1k rows ≈ 30 seconds).

  4. Use the Semantic Layer

    Once ready:

    • Explore Semantics:
      • Navigate to Semantics to review auto-generated tables/dimensions/metrics.
      • Edit names, data types, or relationships if needed (optional).
    • Ask Questions:
      • Type natural language queries in the Chat: "Show total sales by region last quarter" → Converts to SQL/visualizations.
      • For supported question formats, please refer to the chapter "For knowledge users".

Example Use Case

File: customer_feedback.csv

  • Automatically Generated Semantics:

    • Data Model: Feedback
    • Dimensions: Product_ID, Response_Date, Sentiment (auto-detected as text categories).
    • Metrics: Complaint_Count = COUNTIF(Response_Type = 'Complaint').
  • Query: "Trend of complaints by product for 2023" → Line chart + table output.

Troubleshooting

IssueSolution
"Processing failed"Re-upload with standardized headers and no formula errors.
Incorrect metricsEdit the metric logic in Data Models > Metrics.
Slow queriesUse filters (e.g., date ranges) to narrow large datasets.
Unrecognized columnsEnsure headers are descriptive (avoid "Column A").

Configuration

Customize to align with your analytical workflow through these settings:

1. Enable Follow-Up Questions

Control whether LLMs suggest next-step queries after each response. Toggle State:

  • ON (Default): After answering a query, the system generates relevant follow-up questions.

  • OFF: Only the direct answer is provided.

2. Semantic Recall Sensitivity

Adjust how strictly semantic elements (metrics/dimensions/filters) are recalled during queries.

  • Match Sensitivity: Control how strictly elements are recalled in queries. Provide 4 scales, including Blur, Medium, High, Exact.

  • Max Returned Matches:

  • Sets a cap on potential matches per element type.

  • Example: If set to 3 for "sales", returns only the top 3 matches:

Sales_Amount, Online_Sales, Store_Sales