Guides → Configure Tenants

You can manage your tenant(s) through the Cluster Management Console (CMC) by configuring various settings, such as a minimum password length, default language, maximum cached memory, and other settings. Here are the steps to configure your tenant(s):

  • Sign in to the CMC.
  • Configure your tenants as follows:
    • To set the configurations of a specific tenant, select Clusters > Cluster-name > Tenants, and then for the tenant row, select Configure.
    • To set your default tenant configurations, select Cluster Configurations > Default Tenant Configurations.

The following tenant configurations are available in the CMC:

Warning: Save Changes

To avoid losing data after configuring your settings on a page, select Save before navigating to another page.

Important: Cloud Admin Limited Access

Some configurations are not available when using the cloud admin account to sign in to the CMC. These configurations are marked on this document with an asterisk (*). Contact Incorta Support to have any of these configurations set or altered.

Security

The security configurations include the authentication types and other important settings within Incorta Analytics, such as enabling Super User mode and support for password-protected Excel files.

The following table illustrates the security configuration properties:

Configuration PropertyControlRequires RestartDescription
Enable Inviting userstoggleNoEnable this property to invite users by email. The Super User tenant admins and users with a SuperRole access right can invite external users to join their tenants.

Note the following:
  • The Super User tenant admin must properly set the
  • Email
  • configurations for the respective tenant.
  • The inviter must be assigned to at least one group.
  • This property is enabled by default for trial users and disabled for customers.
For more information, refer to Security Manager → Manage Invitations.
Authentication Typedrop down listNoSelect the authentication type you support from the drop down list. The available types are:
Allow Different Authentication Type for External ToolstoggleYes, all servicesEnable this property to authorize a different authentication type for external tools, and then select the authentication type from the External Tools Authentication Type drop down list. The supported types are:
  • Incorta Authentication
  • LDAP
  • Azure AD
Enable Super User ModetoggleNoEnable this property to enable non-Admin users with the SuperRole to access all entities within a tenant. Disable this property to restrict access for all non-Admin users and provide access to shared entities only.
Note: This option does not affect the access to folders and dashboards. Any user, including the Super User and users with the SuperRole must own or have access rights to a dashboard or folder to view it in the Content Manager.
Enable Support for Password-Protected Excel FilestoggleNoEnable this property to support the upload of password-protected Excel files using the Data Manager.

Prerequisite: you must install Java Cryptography Extension (JCE) to use this feature.

Here are the steps to install JCE:
  ●   Download and extract JCE from http://www.oracle.com/technetwork/java/javase/downloads/jce8-download-2133166.html
  ●   Copy the JCE jars (local\_policy.jar and US\_export\_policy.jar) into $JAVA\_HOME/jre/lib/security Restart Incorta CMC.
Download insightstoggleNoDisable this toggle to limit users with User or Individual Analyzer user roles from downloading insights.

Incorta Authentication

The following table illustrates the configuration properties of the Incorta authentication.

Warning

The default password policy has changed starting 2022.10.0. The default policy will be applied only to newly created passwords, whether for new users creating their first passwords or existing users changing their passwords. However, this will not require that existing users should change their passwords. The new password policy mandates the following:

  • The minimum password length is 12 alphanumeric characters.
  • A password must contain at least one uppercase letter, one lowercase letter, one special character, and one digit.
  • A password cannot include the user login name or email address.
Configuration PropertyControlRequires RestartDescription
Authentication Typedrop down listNoSelect Incorta Authentication to sign in to Incorta using the Incorta Analytics username and password. You can also configure password rules and other security-related options.
Minimum Password Lengthspin boxNoEnter the minimum number of characters for a password. The Default is 5 characters.
Password Cannot Include UsernametoggleNoEnable this property to restrict use of a username in passwords.
Require Lower Case LetterstoggleNoEnable this property to require lower-case letters in passwords.
Require Upper Case LetterstoggleNoEnable this property to require upper-case letters in passwords.
Require DigitstoggleNoEnable this property to require numbers in passwords.
Require Special CharacterstoggleNoEnable this property to require special characters in passwords.
Enable Password ExpirationtoggleNoTurn this toggle on to force Incorta authenticated user to change their password at a scheduled cadence.
Password validity in daystext boxNoEnter the password validity period in days. This property is available when you turn the Enable Password Expiration toggle on.
The valid value is between 1 and 365 inclusive. The default value is 90 days.
Note

The password expiration feature is available starting the 2022.10.0 release and it is disabled by default.

  • The start of the password validity period is the password modification date.
  • After upgrading to 2022.10.0, the password modification date will be the upgrade date.
  • For newly created user accounts, the password modification date will be the day when the user creates a new password after successfully signing in to Incorta for the first time.
  • When a password expires and the user tries to sign into Incorta, a message appears denoting that the password has expired. The user has to enter the current password and create a new one.
  • A User Manager or a Super User can also use the resetPasswords public API endpoint to instantly reset the password of all Incorta authenticated users.

The password policy options related to Incorta authentication will remain visible even when you select another authentication type. However, they will affect Incorta authenticated users only, such as the Super User and other users you specify Incorta as their authentication method.

Single Sign-On

The following table illustrates the configuration properties of the Single Sign-On (SSO) authentication:

Configuration PropertyControlRequires RestartDescription
Authentication Typedrop down listYes, if there are changes in the 'server.xml' fileSelect SSO to sign in to Incorta using a standard SSO tool. This requires enabling Allow Different Authentication Type for External Tools.
For more information, refer to Guides → Configure SSO.
Provider typedrop down listNoSelect your provider type. The options are:
  ●   SAML2
  ●   Okta
  ●   Auth0
  ●   Custom (requires entering the name of the custom provider for a specific customer)
Provider configurationstext boxNoEnter the properties or XML configurations for your selected provider.
Auto provision SSO userstoggleNoEnable this property to automatically add all SSO users to Incorta after successful authentication.
Auto provisioned SSO users default groupdrop down listNoSelect the newly provisioned SSO users to be assigned to this group.

Lightweight Directory Access Protocol

The following table illustrates the configuration properties of the Lightweight Directory Access Protocol (LDAP) authentication:

Configuration PropertyControlRequires RestartDescription
Authentication Typedrop down listNoSelect LDAP to sign in to Incorta using the LDAP system user and password.
Base Provider URLtext boxYes, all servicesEnter a URL to access the LDAP server in the following format: ldap://<LDAP_SERVER_HOST>:<LDAP_SERVER_PORT>
Base Distinguished Nametext boxYes, all servicesEnter a name attribute to describe the location where the LDAP search should begin.
System Usertext boxYes, all servicesEnter the LDAP System User attribute. If searching your LDAP server does not require authentication, do not set this property.
System User Passwordtext boxYes, all servicesEnter the password for the LDAP System User.
User Mapping Logintext boxYes, all servicesEnter the attribute in LDAP that can be used to sign in to Incorta Analytics. This is the name of the column that contains a unique identifier for users in LDAP, such as an email or ID.
User Mapping Authenticationtext boxYes, all servicesOptionally, enter an attribute to authenticate users.
If no attribute is entered, the value entered for User Mapping Login will be used to authenticate users.

Azure Active Directory

The following table illustrates the configuration properties of the Azure Active Directory (AD) authentication:

Configuration PropertyControlRequires RestartDescription
Authentication Typedrop down listNoSelect Azure AD to sign in to Incorta using the Client ID and Client Secret.

For more information on how to obtain the Azure AD Client ID and Client Secret, refer to Quickstart: Register an app in the Microsoft identity platform.
Client IDtext boxNoEnter the Application (Client) ID for the App registration defined on the Azure AD.
Client Secrettext boxNoEnter the Client Secret value (application password) that is defined for the Client ID on the Azure AD.

Regional Settings

The following table illustrates the configuration properties of the regional settings:

Configuration PropertyControlRequires RestartDescription
Languagedrop down listNoSelect the default language for your tenant using the ISO two-letter codes. For example, fr stands for the French language.
Countrydrop down listNoSelect the default country for your tenant.
Text Directiondrop down listNoSelect the text direction for your tenant:
  • Left to right
  • Right to left

Email

The following table illustrates the configuration properties of the tenant email:

Configuration PropertyControlRequires RestartDescription
SMTP Hosttext boxNoEnter the server used to send emails. For example, smtp.gmail.com.
Enable SMTP SSLtoggleNoEnable this property to establish a port for Simple Mail Transfer Protocol Secure (SMTP) to operate using SSL.
Socket Factory Portspin boxNoEnter the port number used for connection when using an SMTP Socket Factory.
Email Host Requires AuthenticationtoggleNoEnable this property if the email host, such as Gmail, requires authentication before sending an email.
SMTP Server Portspin boxNoEnter the port number to use for connecting to the email server.
Email Server Protocoldrop down listNoSelect an email protocol:
  • smtp
  • ews
Export Server Timeout in Seconds*spin boxNoEnter the required server timeout in seconds when sending an HTML dashboard.
If a dashboard is not sent due to a server timeout, you can increase this value to resolve the issue.
Sender's Username AuthtoggleNoEnable this property to use a username for the SMTP authentication instead of the sender's email.
System Email Usernametext boxNoEnter the SMTP username.
This property is available when Sender's Username Auth is enabled.
System Email Addresstext boxNoEnter the email address to use for sending dashboards or notifications.
Person Name*text boxNoEnter the name to display as the email sender.
System Email Passwordtext boxNoOptionally, enter the account password or an app password for the email account used for sending dashboards or notifications.
Leave this field blank if the email server does not require a password for the sender’s email.
Forgot Password Email Subject*text boxNoEnter the text to display in the subject line of password recovery emails.
Server URL Protocol*drop down listNoSelect the server url protocol:
  • http
  • https
Server Name*text boxNoEnter the current server name, for example, localhost.
Server Port*spin boxNoEnter the current server port, for example, 8080.
Enable Sharing NotificationstoggleNoEnable this property to send email notifications when sharing dashboards.
Notification Email Subject*text boxNoEnter the subject line of a sharing notification email.
Enable Internal Error NotificationstoggleNoEnable this property to be notified of the internal error(s) by email.
Internal Error Notification Subject*text boxNoEnter the subject line of the internal error notification emails.
Note

If you have set up two-step verification for the email account you use to send scheduled dashboards and notifications, you must use an app password instead of the account password for the System Email Password option.

For details about how to create an app password, refer to the documentation of your email service provider. For example, for an Office 365 email account, refer to Manage app passwords for two-step verification, and for a Gmail account, refer to Sign in with app passwords.

Data Loading

The following table illustrates the configuration properties related to data loading:

Configuration PropertyControlRequires RestartDescription
Enable Always CompacttoggleNoIn releases before 2023.1.0, enable this property to trigger automatic compaction during schema load.

Note the following:
  • To compact a preloaded schema, either restart the server, or reload the schema (full or incremental).
  • This property must be enabled if SQLi is enabled.
Starting with 2023.1.0, creating a compacted version of Parquet files during load jobs is no longer an option. The Loader Service will start a compaction job during all load jobs. During loading from staging, the Loader Service initiates a compactions job if it detects issues with the compacted version of an object.
Stop Loading on First ErrortoggleNoEnable this property to stop a schema extraction or loading on the first encountered error.
Pause Scheduled JobstoggleNoEnable this property to pause all the active schedule jobs including schema loads, dashboards, and data alerts.

Data Management

The following table illustrates the configuration properties related to data management:

Configuration PropertyControlRequires RestartDescription
Stop Loading on First ErrortoggleNoEnable this property to stop a schema extraction or loading on the first encountered error.
Pause Load PlanstoggleNoEnable this property to pause all the active load plans.
Pause Dashboard SchedulertoggleNoEnable this property to pause all the active scheduled dashboards.
Pause Data NotificationstoggleNoEnable this property to pause all the active data alerts.
Earliest Valid Yearspin boxNoEnter the earliest year you want to extract from your data sources. Years prior to the entered date will be extracted as NULL.
Default value is 0, that will allow all years to be extracted.

Integration

The following table illustrates the integration configuration properties:

Configuration PropertyControlRequires RestartDescription
Google Maps API Keytext boxNoEnter the API key for Google Maps.
Apple Maps Developer Team IDtext boxNoEnter the Developer team ID to use for enabling Apple Maps in Incorta’s maps visualizations.
Apple MapKit JS Key IDtext boxNoEnter the Apple MapKit JS key to use for enabling Apple Maps in Incorta’s maps visualizations.
Apple Maps API Keytext boxNoEnter the Apple Maps API key to use for enabling Apple Maps in Incorta’s maps visualizations.
Google Drive Client IDtext boxYes, all servicesEnter the Google Drive Client ID used by the Google Drive connector.
Google Drive Client Secrettext boxYes, all servicesEnter the Google Drive Client Secret used by the Google Drive connector.
Dropbox App Keytext boxYes, all servicesEnter the Dropbox App Key used by the Dropbox connector.
Dropbox App Secrettext boxYes, all servicesEnter the Dropbox App Secret used by the Dropbox connector.
Box Client IDtext boxYes, all servicesEnter the Box Client ID obtained from the Box application.
Box Client Secrettext boxYes, all servicesEnter the Box Client Secret obtained from the Box application.
Mapbox API Keytext boxNoEnter the Mapbox API Key of the Advanced Mapbox.
Incorta Slack IntegrationtoggleNoAvailable starting the 2024.1.0 release.
Enable to allow the integration between Incorta and your Slack workspace.
Slack app Client IDtext boxNoEnter your unique Slack app client identification number.
Slack app Client Secrettext boxNoEnter your application client secret number found in your App Credentials section in Slack.
Incorta Slack bot tokentext boxNoEnter the bot user OAuth token for your Slack workspace.
Incorta Microsoft Teams IntegrationtoggleNoAvailable starting the 2024.1.0 release.
Enable to allow the integration between Incorta and your Microsoft Teams.
OAuth 2.0-based authentication for JDBC connectiontoggleNoAvailable starting the 2024.1.0 release.
Enable this feature to allow OAuth 2.0 client applications to authenticate with Incorta JDBC. Note that when this feature is enabled, the use of personal access tokens for JDBC authentication will no longer be possible.

Export

The following table illustrates the configuration properties related to exporting Insights:

Configuration PropertyControlRequires RestartDescription
Export All Data from Listing or Aggregated Tables*

(Before 2024.1.0: Export All Data)
toggleNoEnable this property to export or download all data from Listing or Aggregated tables to CSV or Excel files. Enabling this option will override the row limit set in the Download Rows Limit option.
Export All Data from Pivot Tables*

(Before 2024.1.0: Export All Data (Pivot Tables))
toggleNoEnable this property to export or download all data from Pivot tables to Excel files. Enabling this option will override the row limit set in the Download Rows Limit option.
Download Rows Limit*spin boxNoEnter the maximum number of rows to download or export data from a Listing, Aggregated, or Pivot table to a CSV or Excel file.

This option applies only to exporting from Listing or Aggregated tables to CSV or Excel files when Export All Data from Listing or Aggregated Tables is disabled. Similarly, this option is effective only for exporting from Pivot tables to Excel files when Export All Data from Pivot Tables is disabled.
Add BOM to CSV*toggleNoEnable this property to add a Byte Order Mark (BOM) to CSV files.
This property must be enabled to export Unicode-encoded files.
Include Prompt Selections in Excel Export*toggleNoEnable this property to add the applied prompt filters to the first line in an exported or downloaded Excel file.
Export Folder*uploadNoSelect + Add Folder to enter the destination folder’s name and path on the Incorta host server to use for sending or scheduling a dashboard.

Advanced

The following table illustrates the advanced configuration properties related to materialized views (MVs) and the maximum groups default and limit in insights.

Configuration PropertyControlRequires RestartDescription
Enable Hive Context in MVs*toggleNoEnable this property to support Hive context in MVs.
Python Pathtext boxNoEnter the Python binary executable to use for Python MVs in the driver and executors.
Insight Max Groups UI Defaultspin boxNoEnter the default maximum number of groups that cannot be exceeded in an insight.

You can also configure this property for an insight in the Settings panel of the Analyzer. The value in the insight overrides the value configured in the CMC.
Insight Max Groups Limitspin boxNoEnter the limit for the maximum number of groups that cannot be exceeded in an insight.

Entering the values 0 or -1 will set the number of groups to unlimited. This value cannot be overridden.
Insight Max Values in Filter Subqueryspin boxNoEnter the maximum values for an insight in the filter subquery.
Force Reload ColumnstoggleYes, Analytics Service onlyEnable this property to automatically reload the data in memory after a schema load. For example, after an incremental update.

If disabled, data will only be reloaded when accessing a dashboard that queries such data, which may impact the performance and cause a delayed response.
Sync in Background*toggleYes, Analytics Service onlyEnable this property to synchronize your data in the background during a schema load by reading columns in separate memory on the Analytics service.
Warmup Mode on Analytics Servicesdrop down listNoSelect an option to load and warmup specific columns first on the Analytics service. This decreases the time to load dashboard queries. The options are:
  ●   None - works best for small deployments with ad-hoc queries.
  ●   Last Used Columns - load the previous state prior to shut down in the analytics service.
  ●   Business View Columns - load all columns referenced in business schema views on the Analytics service only.
  ●   Most Used Dashboards Columns - load the most used dashboards columns first.
  ●   All - works best for supporting ad-hoc queries, in case no business schemas are in place, and the time between the Analytics service startup and dashboard usage is significant.
Warmup Mode on Loader Servicesdrop down listNoSelect an option to load and warmup specific columns first on the Loader service. The options are:
  ●   None - works best for small deployments with ad-hoc queries.
  ●   Last Used Columns - load the previous state prior to shut down in the Loader service.
  ●   All - works best for supporting ad-hoc queries.
Turn off/on Global Auto Refresh for InsightstoggleNoEnable this property to turn on Auto Refresh by default for all insights.

You can turn off Auto Refresh to allow the Analyzer Users to turn on Auto Refresh for individual insights.

Tuning

The following table illustrates the tuning configuration properties:

Configuration PropertyControlRequires RestartDescription
Maximum Concurrent Load Jobsdrop down listYes, Loader Service onlySelect the maximum number of load jobs (up to 10) that the Loader Service can handle at the same time, regardless of the number of physical schemas in each load job. The default value is 5.

This property is available starting the 2022.11.0 release.
Schema Pool Sizedrop down listYes, Loader Service onlySelect the number of schemas (up to 10) that can be loaded at a time per tenant. The default value is 5.

This option is available in releases before 2022.11.0.
Table Maximum Parallel Chunks*drop down listYesSelect the number of concurrent chunks (between -1 and 10) that you can extract from the source at the same time.
Individual and dashboard filter optimizationtoggleNoEnable this property to improve the performance of filtering data, and accordingly reduce the overall rendering time of insights and dashboards. This feature optimizes filtering data using insight individual filters and when applying runtime security filters and dashboard filters (including dashboard runtime filters, applied filters, dashboard prompts, and filter options).
This property doesn’t affect queries with non-regular expressions, such as First Version, Last Version, and Distinct.
Maximum Cached Entriesspin boxYes, Analytics Service onlyEnter the maximum number of insights or MVs to store in the cache. The default value is 2000. Caching an insight larger than 10MB may affect performance.

Incorta uses both the Maximum Cached Entries and the Maximum Cached Memory (%) properties to determine the cache size, and then uses the lower of the two limits.

For example, if you set off-heap memory to 100GB, and Maximum Cached Memory (%) to 1%, the cache size will be 1GB. Therefore, if you set Maximum Cached Entries to 2000 and you reach 1GB with fewer reports, Incorta will stop caching reports at 1GB.
Maximum Cached Memory (%)spin boxYes, Analytics Service onlyEnter a percentage between 0% and 10% of the total off-heap memory to use as the maximum memory size for cached reports. The default is 5% of the off-heap memory.
Export to CSV/XLSX caching limit (In megabytes)spin boxYes, Analytics Service onlyThe maximum data size of the query result that the Analytics Service caches in memory for insights downloaded, sent to data destinations, sent to a download folder, or shared via email in XLSX and CSV formats. The Analytics Service checks for a cached version before executing the insight query, regardless of the requesting user. Valid values are between 0 and 100 inclusive.

The Analytics Service won’t cache insights in the following cases:
  ●  You set this property to 0.
  ●  The query result size exceeds the limit you define.
  ●  You set the Maximum Cached Memory (%) or Maximum Cached Entries options to 0, or these options reach their maximum limits.
  ●  You download or share a dashboard or tab with multiple insights in XLSX format.

Note: When you download or share a dashboard or tab with multiple insights in CSV format, only the first supported insight is downloaded or shared, and cached if applicable.
Evaluate Session Variables At Login*toggleYesEnable this property to evaluate all session variables on a separate thread at session login. This allows you to evaluate several session variables used in different insights on a single dashboard.
Query timeouttoggleNoEnable this property to allow queries to be terminated automatically if they exceed the specified limit to run, releasing the read lock acquired on the related entities and freeing the resources reserved by these queries.
This property affects querying data in the following scenarios:
  ●  Creating an insight
  ●  Rendering a dashboard
  ●  Sending a dashboard via email or to a data destination
  ●  Downloading a dashboard as CSV or Excel
  ●  Previewing a chart as a listing or aggregated table
  ●  Creating or updating an Incorta Analyzer table or view
  ●  Exploring the data of a table or view
  ●  Creating a data notification condition
Query timeout (in minutes)spin boxNoSpecify a time limit after which running queries will be terminated automatically. The default is 10 minutes.
This property is available only when Query timeout is enabled.
Note

The Individual and dashboard filter optimization and Query timeout features are available starting with the 2023.4.0 release while the Export to CSV/XLSX caching limit (In megabytes) feature is available starting with 2023.4.1. Starting with the 2023.7.0, The Individual and dashboard filter optimization option is available to the cloud admin user.

Incorta Labs

Note

An Incorta Labs feature is experimental and can be either promoted to a product feature or deprecated without notice.

The following table illustrates the configuration properties of Incorta Labs:

Configuration PropertyControlRequires RestartDescription
Start Engine on a background thread*toggleNoEnable this property to start the engine on a background thread.
Enable saving query plantoggleNoEnable this property to keep the join paths of your insights fixed.

If a change in a data model causes an invalid join path, the affected insights will remain the same unless their owners approve to update them.
Incorta-AssisttoggleNoEnable this property to have a bot assist you by suggesting and creating joins in a matter of minutes.
Enable Insight 'View As' MenutoggleNoEnable this property to show the View As menu in all the Dashboard Insight action menus.
Enable Incorta SQL TabletoggleNoEnable this property to create an Incorta SQL Table.

The Incorta SQL Table will use a new SQL Engine that supports all types of joins and provides better SQL coverage. The new SQL engine will query from data loaded in the memory.
Machine Learning support*toggleNoEnable this property to have machine learning support.
Enable Visualizations SDK*toggleNoEnable this property to use external visualizations from the marketplace.

For more information, refer to Incorta Component SDK.
Enable creation of SQL ViewstoggleNoEnable this property to create Incorta SQL View using SQL.

SQL in the Incorta SQL View uses a new SQL Engine that supports all types of joins and provides better SQL coverage. Dashboards can use SQL View to query and analyze data. The output of SQL View is generated during run time.
Enable internal join cache for SQL Engine*toggleNoEnable this property to internally join the cache for the SQL Engine. This will improve the SQL performance of SQL View.
Enable Insights over Insights*toggleNoEnable this property to create insights over result sets
Enable Bursting Reports*toggleNoEnable this setting to allow Incorta to distribute reports based on the recipients’ context instead of the sender’s context. You can send or schedule a bursting report or dashboard for up to 300 internal users with Incorta accounts and at least view access rights to the dashboard. The report runs once but renders customized data based on the security filters or session variables of each recipient.

For more details, refer to Bursting Reports.
Max concurrent chromiums*text boxYes, Analytics Service onlySpecify the max number of concurrent headless chromium browsers
Max no. of selections for contains filterspin boxNoDefault is -1, which indicates unlimited number of selections. Enter the maximum number of selected values in dashboard prompts for string mathcing operators (In, Not In, Starts With, Does Not Start With, Contains, Does Not Contain, Ends With).
Note: 0 indicates unlimited numbers of selections as well.
Auto Detect Column TypetoggleNoDisable this property to turn off auto-detection and columns update in a schema table with two or more data sources.
Maximum number of versions per entityspin boxNoEnter the maximum number of versions to maintain for each entity, such as a schema or dashboard. The default value is 50.
Backup frequencydrop down listNoSelect the cadence to create versions for each entity. This can range from a record of each change to a weekly backup. The options are:
  • Keep all versions
  • 5, 10, 15, or 30 minutes
  • 1, 2, 4, 6, or 12 hours
  • 1 Day
  • 1 Week
Notebook IntegrationtoggleNoEnable this property to integrate the notebook.
Notebook is not currently supported on Windows environments.
Enable Dynamic Allocation in NotebooktoggleNoEnable this property to use dynamic allocation by default in all notebooks.

Disabling this property will cause notebook sessions to allocate the full resource of the materialized views in their interactive session. Thus, it is recommended to enable this property to avoid consuming Spark cluster resources by the notebook sessions.
Enable Multi-Schema Loading*toggleNoTurn this toggle on to be able to schedule load plans that involve loading multiple physical schemas.

This option is available starting with 2022.11.0 and is deprecated in 2023.7.0 where the multi-schema load plans are general availability (GA) feature that is always enabled.

External Visualization Tools

The following table illustrates the configuration properties related to the external visualization business tools:

Configuration PropertyControlRequires RestartDescription
Default Schemas*text boxNoEnter the list of default schemas, separated by commas, to be processed in order when a non-fully qualified table is encountered in an SQL query from an external BI tool.
Incorta Host*text boxNoEnter the IP address of the Incorta Server (or Proxy or Load Balancer, if applicable) to use for connecting the external BI tools to Incorta.
Incorta Port*spin boxNoEnter the port number of the Incorta Server (or Proxy or Load Balancer, if applicable) to use for connecting the external BI tools to Incorta.
Excel Addin*toggleNoEnable this property to see instructions on how to connect Excel as an external BI tool to Incorta.
MicroStrategy*toggleNoEnable this property to see instructions on how to connect MicroStrategy as an external BI tool to Incorta.
Tableau*toggleNoEnable this property to see instructions on how to connect Tableau as an external BI tool to Incorta.
PowerBI*toggleNoEnable this property to see instructions on how to connect PowerBI as an external BI tool to Incorta.

MV Assistant

Specifying the Spark-related configurations of each MV is a very important step toward improving the MV loading time and performance. However, it is not a simple task to identify the appropriate value combination of these configurations. The 2022.9.0 release introduces the MV Assistant feature to facilitate identifying the appropriate values for each MV. When the feature is enabled, Incorta collects various metrics during each MV load job, analyzes these metrics, and simulates different values to identify and recommend the optimal value combination.

Note

The MV Assistant is a preview feature in the 2022.9.0 release.

The MV Assistant mainly consists of a Spark Listener, a Heuristics Recommender, and a Cleanup job.

  • The Spark Listener collects the required metrics while running the Spark application of an MV load job, simulates different values of the Spark configurations, and saves the output to a file in the following path: <TENANT_NAME>/mvlenslogs/pending/<SCHEMA_NAME>/<TABLE_NAME>/.
  • The Heuristics Recommender uses the output file created by the Spark Listener and applies several heuristics to find the most optimal set of values and saves them to the metadata database. After using the output file, it is moved to the <TENANT_NAME>/mvlenslogs/archived/<SCHEMA_NAME>/<TABLE_NAME>/. directory.
  • The Cleanup Job, when enabled, deletes the archived files that the Heuristics Recommender has already used after a specific number of days or after reaching the maximum number of archived files to keep. For details, see the MV Assistant Properties

The following are the configurations that the MV Assistant analyzes and recommends new values for.

  • Executor instances (spark.executor.instances): The total number of executors to allocate for the application
  • Executor cores (spark.executor.cores): The number of cores per each executor
  • Executor memory (spark.executor.memory): The amount of memory to be allocated for each executor
  • Shuffle partitions (spark.sql.shuffle.partitions): Determines how many partitions should the data be partitioned into after shuffling

Recommended values are saved to the metadata database per MV and are introduced via a dedicated dashboard. Schema developers can manually apply the recommended values per MV.

Note

Contact Incorta Support to get the required assets.

Feature limitations

  • The MV Assistant runs on the level of a single MV. Only one instance of the MV Assistant can run per tenant at a time.
  • Recommended values cannot be automatically populated to the related MV or other MVs. You have to set them manually.
  • This feature aims at enhancing the MV loading time and performance regardless of the cost and resources; however, they will be limited to the defined configurations.
  • The MV Assistant does not consider the cluster size.
  • The MV Assistant does not currently support the dynamic resource allocation option at the MV level or system-wide and is intended for those customers managing MV resource allocation with default configuration with overrides at the individual level.
  • The recommended values are for both full and incremental load jobs.
Notes
  • The MV Assistant requires at least 1 successful load for an MV to provide recommendations for it.
  • The MV Assistant introduces recommendations when there is a chance of improvement in the loading time and performance, otherwise it will introduce no recommendations.
  • Only MVs with recommendations are recorded in the database and represented in the dashboard.
  • The improvement percentage and value are approximate.
  • The MV Assistant consumes a very minor amount of system resources during each MV job submission.

MV Assistant Properties

The following table illustrates the available properties of the MV Assistant.

Configuration PropertyControlRequires RestartDescription
MV AssistanttoggleYes, Loader Service onlyEnable or disable the MV Assistant
Maximum amount of memory per corespin boxYes, Loader Service onlyThe maximum amount of memory in GB per each core, which will derive the maximum amount of memory that can be assigned to each executor.
The accepted value is between 1 and 20.
Maximum number of total coresspin boxYes, Loader Service onlyThe maximum total number of cores the MV Assistant can recommend per Spark application. The accepted value is between 5 and 1000
MV Assistant Interval (minutes)spin boxYes, Loader Service onlySpecify the time interval to run the recommender component of the MV Assistant
Enable runs cleanuptoggleYes, Loader Service onlyEnable or disable the cleaner job. The cleaner job deletes archived files created by the MV Assistant
Cleanup interval (days)spin boxYes, Loader Service onlySpecify the time interval to run the cleaner
Retention period for runs (days)spin boxYes, Loader Service onlySpecify the number of days to keep the files created by the MV Assistant during an archived run.
This option, along with the Maximum number of archived runs per MV option, determines the files to keep and the ones to delete. Whatever option is satisfied first.
Maximum number of archived runs per MVspin boxYes, Loader Service onlySpecify the maximum number of archived run files to keep.
This option, along with the Retention period for runs (days) option, determines the files to keep and the ones to delete. Whatever option is satisfied first.