Veritas Data Insight Administrator's Guide

Last Published:
Product(s): Data Insight (7.0)
Platform: Windows
  1. Section I. Getting started
    1. Introduction to Veritas Data Insight administration
      1. About Veritas Data Insight administration
        1.  
          Operation icons on the Management Console
        2.  
          Data Insight administration tasks
    2. Configuring Data Insight global settings
      1.  
        About Data Insight licensing
      2.  
        SQLite WAL mode
      3.  
        Configuring SMTP server settings
      4. About scanning and event monitoring
        1. Configuring scanning and event monitoring
          1.  
            Considerations for running a parallel scan
      5.  
        Monitoring Indexer Node Storage Utilization
      6. About filtering certain accounts, IP addresses, and paths
        1.  
          About exclude rules for access events
        2.  
          About exclude rules for Scanner
        3. Adding exclude rules to Data Insight
          1.  
            Add/Edit Exclude rule for access events options
          2.  
            Add/Edit Exclude rule for Scanner options
      7. About archiving data
        1.  
          About purging data
        2.  
          Configuring data retention settings
        3.  
          Parameterized Purging of access data using Data Retention
        4.  
          Purging indexes by date/whitespace for deleted files
      8. About Data Insight integration with Symantec Data Loss Prevention (DLP)
        1.  
          About configuring Data Insight to integrate with Data Loss Prevention (DLP)
        2.  
          Configuring Symantec Data Loss Prevention settings
        3.  
          Importing SSL certificate from the DLP Enforce Server to Data Insight Management Server
        4.  
          About Symantec Data Loss Prevention (DLP) integration with Data Insight
      9.  
        Importing sensitive files information through CSV
      10. Configuring advanced analytics
        1.  
          Choosing custom attributes for advanced analytics
      11. About open shares
        1.  
          Configuring an open share policy
      12. About user risk score
        1.  
          User risk weight configuration
      13.  
        Configuring file groups
      14.  
        Configuring Workspace data owner policy
      15.  
        Configuring Management Console settings
      16. About bulk assignment of custodians
        1.  
          Assigning custodians in bulk using a CSV file
        2.  
          Assigning custodians based on data ownership
      17.  
        Configuring Watchlist settings
      18. Configuring Metadata Framework
        1.  
          Using the metadata framework for classification and remediation
      19.  
        Proof of concept
  2. Section II. Configuring Data Insight
    1. Configuring Data Insight product users
      1.  
        About Data Insight users and roles
      2.  
        Reviewing current users and privileges
      3. Adding a user
        1.  
          Add or edit Data Insight user options
      4.  
        Editing users
      5.  
        Deleting users
      6.  
        Configuring authorization for Symantec Data Loss Prevention users
      7.  
        Configuring single sign-on (SSO) using security assertion markup language (SAML)
      8.  
        Disabling single sign-on (SSO)
    2. Configuring Data Insight product servers
      1.  
        About Data Insight product servers
      2.  
        Adding a new Data Insight server
      3.  
        Managing Data Insight product servers
      4.  
        Viewing Data Insight server details
      5. About node templates
        1.  
          Managing node templates
        2.  
          Adding or editing node templates
      6.  
        Adding Portal role to a Data Insight server
      7.  
        Adding Classification Server role to a Data Insight server
      8.  
        Assigning Classification Server to a Collector
      9.  
        Associating a Classification Server pool to a Collector
      10.  
        Viewing in-progress scans
      11.  
        Configuring Data Insight services
      12.  
        Configuring advanced settings
      13.  
        Monitoring Data Insight jobs
      14.  
        Rotating the encryption keys
      15.  
        Viewing Data Insight server statistics
      16. About automated alerts for patches and upgrades
        1.  
          Viewing and installing recommended upgrades and patches
      17.  
        Deploying upgrades and patches remotely
      18.  
        Using the Upload Manager utility
      19.  
        About migrating storage devices across Indexers
      20.  
        Viewing the status of a remote installation
    3. Configuring saved credentials
      1. About saved credentials
        1.  
          Managing saved credentials
      2.  
        Handling changes in account password
      3.  
        Data Insight Hash Utility
    4. Configuring directory service domains
      1.  
        About directory domain scans
      2. Adding a directory service domain to Data Insight
        1.  
          Add/Edit Active Directory options
        2.  
          Add/Edit LDAP domain options
        3.  
          Add/Edit NIS domain options
        4.  
          Add/Edit NIS+ domain options
        5. Add/Edit Azure active directory service
          1.  
            Prerequisites for configuring Azure AD
          2.  
            Registering Data Insight with Microsoft to scan Azure AD
          3.  
            Configuring application without user impersonation for Microsoft 365
          4.  
            Creating an application in the Microsoft Azure portal
      3.  
        Managing directory service domains
      4.  
        Fetching users and groups data from NIS+ scanner
      5.  
        Configuring attributes for advanced analytics
      6.  
        Deleting directory service domains
      7.  
        Scheduling scans
      8.  
        Configuring business unit mappings
      9.  
        Importing additional attributes for users and user groups
    5. Configuring containers
      1.  
        About containers
      2. Adding containers
        1.  
          Add new container/Edit container options
      3.  
        Managing containers
  3. Section III. Configuring native file systems in Data Insight
    1. Configuring NetApp 7-mode file server monitoring
      1.  
        About configuring NetApp file server monitoring
      2.  
        Prerequisites for configuring NetApp file servers
      3.  
        Credentials required for configuring NetApp filers
      4.  
        Credentials required for configuring NetApp NFS filers
      5.  
        Configuring SMB signing
      6.  
        About FPolicy
      7.  
        Preparing Data Insight for FPolicy
      8.  
        Preparing the NetApp filer for Fpolicy
      9.  
        Preparing the NetApp vfiler for Fpolicy
      10.  
        Configuring NetApp audit settings for performance improvement
      11.  
        Preparing a non-administrator domain user on the NetApp filer for Data Insight
      12.  
        Enabling export of NFS shares on a NetApp file server
      13.  
        Excluding volumes on a NetApp file server
      14.  
        Handling NetApp home directories in Data Insight
    2. Configuring clustered NetApp file server monitoring
      1.  
        About configuring a clustered NetApp file server
      2.  
        About configuring FPolicy in Cluster-Mode
      3.  
        Pre-requisites for configuring clustered NetApp file servers
      4.  
        Credentials required for configuring a clustered NetApp file server
      5.  
        Preparing a non-administrator local user on the clustered NetApp filer
      6.  
        Preparing a non-administrator domain user on a NetApp cluster for Data Insight
      7.  
        Preparing Data Insight for FPolicy in NetApp Cluster-Mode
      8.  
        Preparing the ONTAP cluster for FPolicy
      9. About configuring secure communication between Data Insight and cluster-mode NetApp devices
        1.  
          Generating SSL certificates for NetApp cluster-mode authentication
        2.  
          Preparing the NetApp cluster for SSL authentication
      10.  
        Enabling export of NFS shares on a NetApp Cluster-Mode file server
      11.  
        Enabling SSL support for Cluster Mode NetApp auditing
    3. Configuring EMC Celerra or VNX monitoring
      1. About configuring EMC Celerra or VNX filers
        1.  
          About EMC Common Event Enabler (CEE)
        2.  
          Preparing the EMC filer for CEPA
        3.  
          Preparing Data Insight to receive event notification
      2.  
        Credentials required for configuring EMC Celerra filers
    4. Configuring EMC Isilon monitoring
      1.  
        About configuring EMC Isilon filers
      2.  
        Prerequisites for configuration of Isilon or Unity VSA file server monitoring
      3.  
        Credentials required for configuring an EMC Isilon cluster
      4.  
        Configuring audit settings on EMC Isilon cluster using OneFS GUI console
      5.  
        Configuring audit settings on EMC Isilon cluster using the OneFS CLI
      6.  
        Configuring Isilon audit settings for performance improvement
      7.  
        Preparing Veritas Data Insight to receive event notifications from an EMC Isilon or Unity VSA cluster
      8.  
        Creating a non-administrator user for an EMC Isilon cluster
      9.  
        Utilizing access zone's SmartConnect Zone/Alias mappings
      10.  
        Purging the audit logs in an Isilon filer
    5. Configuring EMC Unity VSA file servers
      1.  
        About configuring Dell EMC Unity storage platform
      2.  
        Credentials required for configuring an EMC Unity VSA file server
      3.  
        Configuring audit settings on EMC Unity cluster using Unisphere VSA Unity console
    6. Configuring Hitachi NAS file server monitoring
      1.  
        About configuring Hitachi NAS
      2.  
        Credentials required for configuring a Hitachi NAS EVS
      3.  
        Creating a domain user on a Hitachi NAS file server for Data Insight
      4.  
        Preparing a Hitachi NAS file server for file system auditing
      5.  
        Advanced configuration parameters for Hitachi NAS
    7. Configuring Windows File Server monitoring
      1.  
        About configuring Windows file server monitoring
      2.  
        Credentials required for configuring Windows File Servers
      3.  
        Using the installcli.exe utility to configure multiple Windows file servers
      4.  
        Upgrading the Windows File Server agent
    8. Configuring Veritas File System (VxFS) file server monitoring
      1.  
        About configuring Veritas File System (VxFS) file servers
      2.  
        Credentials required for configuring Veritas File System (VxFS) servers
      3.  
        Enabling export of UNIX/Linux NFS shares on VxFS filers
    9. Configuring monitoring of a generic device
      1.  
        About configuring a generic device
      2.  
        Credentials required for scanning a generic device
    10. Managing file servers
      1.  
        About configuring filers
      2.  
        Viewing configured filers
      3. Adding filers
        1.  
          Add/Edit NetApp filer options
        2.  
          Add/Edit NetApp cluster file server options
        3.  
          Add/Edit EMC Celerra filer options
        4.  
          Add/Edit EMC Isilon file server options
        5.  
          Add/Edit EMC Unity VSA file server options
        6.  
          Add/Edit Windows File Server options
        7.  
          Add/Edit Veritas File System server options
        8.  
          Add/Edit a generic storage device options
        9.  
          Add/Edit Hitachi NAS file server options
      4.  
        Custom schedule options
      5.  
        Editing filer configuration
      6.  
        Deleting filers
      7.  
        Viewing performance statistics for file servers
      8.  
        About disabled shares
      9. Adding shares
        1.  
          Add New Share/Edit Share options
      10.  
        Managing shares
      11.  
        Editing share configuration
      12.  
        Deleting shares
      13.  
        About configuring a DFS target
      14.  
        Adding a configuration attribute for devices
      15.  
        Configuring a DFS target
      16.  
        About the DFS utility
      17.  
        Running the DFS utility
      18.  
        Importing DFS mapping
    11. Renaming storage devices
      1.  
        About renaming a storage device
      2.  
        Viewing the device rename status
      3.  
        Considerations for renaming a storage device
  4. Section IV. Configuring SharePoint data sources
    1. Configuring monitoring of SharePoint web applications
      1.  
        About SharePoint server monitoring
      2.  
        Credentials required for configuring SharePoint servers
      3.  
        Configuring a web application policy
      4. About the Data Insight web service for SharePoint
        1.  
          Installing the Data Insight web service for SharePoint
      5.  
        Viewing configured SharePoint data sources
      6. Adding web applications
        1.  
          Add/Edit web application options
      7.  
        Editing web applications
      8.  
        Deleting web applications
      9. Adding site collections
        1.  
          Add/Edit site collection options
      10.  
        Managing site collections
      11.  
        Removing a configured web application
    2. Configuring monitoring of SharePoint Online accounts
      1. About SharePoint Online account monitoring
        1.  
          Prerequisites for configuring SharePoint Online account
      2.  
        Configuring user with minimum privileges in Microsoft 365
      3.  
        Creating an application in the Microsoft Azure portal
      4.  
        Configuring application without user impersonation for Microsoft 365
      5.  
        Adding SharePoint Online accounts
      6.  
        Managing a SharePoint Online account
      7. Adding site collections to SharePoint Online accounts
        1.  
          Add/Edit site collection options
      8.  
        Managing site collections
  5. Section V. Configuring cloud data sources
    1. Configuring monitoring of Box accounts
      1.  
        About configuring Box monitoring
      2.  
        Using a co-admin account to monitor Box resources
      3. Configuring monitoring of cloud sources in Data Insight
        1.  
          Add/Edit Box account
      4.  
        Configuring Box cloud resources through proxy server
      5.  
        Data Insight limitations for Box permissions
    2. Configuring OneDrive account monitoring
      1.  
        About configuring OneDrive monitoring
      2.  
        Configuring user with minimum privileges in Microsoft 365
      3.  
        Creating an application in the Microsoft Azure portal
      4.  
        Configuring application without user impersonation for Microsoft 365
      5.  
        Add/Edit OneDrive account
      6. Adding OneDrive cloud accounts
        1.  
          Add/edit OneDrive user accounts
    3. Managing cloud sources
      1.  
        Viewing configured cloud sources
      2.  
        Managing cloud sources
  6. Section VI. Configuring Object Storage Sources
    1. Amazon S3
      1.  
        About Amazon Simple Storage Service (Amazon S3)
      2.  
        Configuring Amazon S3 account monitoring
      3.  
        Configuring Audit Events in AWS
      4.  
        Creating an Athena table
      5.  
        Adding Amazon S3 account
      6.  
        Limitations for Amazon S3 in Data Insight
      7. Managing Amazon S3 data source
        1.  
          Monitored Buckets
        2.  
          Classification
  7. Section VII. Health and monitoring
    1. Using Veritas Data Insight dashboards
      1.  
        Viewing the system health overview
      2.  
        Viewing the scanning overview
      3.  
        Viewing the scan status of storage devices
      4.  
        Viewing the scan history of storage devices
    2. Monitoring Data Insight
      1.  
        Viewing events
      2.  
        About high availability notifications
      3.  
        Monitoring the performance of Data Insight servers
      4.  
        Configuring email notifications
      5.  
        Enabling Windows event logging
      6.  
        Viewing scan errors
  8. Section VIII. Alerts and policies
    1. Configuring policies
      1.  
        About Data Insight policies
      2. Managing policies
        1.  
          Create Data Activity Trigger policy options
        2.  
          Create User Activity Deviation policy options
        3.  
          Create Real-time Permitted User-based Activity Policy options
        4.  
          Create Real-time Restricted User-based Activity Policy options
        5.  
          Create Real-time Sensitive Data Activity policy options
      3.  
        Managing alerts
  9. Section IX. Remediation
    1. Configuring remediation settings
      1. About configuring permission remediation
        1.  
          Managing and configuring permission remediation
        2.  
          Configuring exclusions for permission recommendation
      2.  
        About managing data
      3.  
        About deleting files
      4. About configuring archive options for Enterprise Vault
        1.  
          Adding new Enterprise Vault servers
        2.  
          Managing Enterprise Vault servers
        3.  
          Mapping file server host names
      5. Setting Microsoft Purview Information Protection (MIP) Label
        1.  
          Creating Minimum Privilege Account Role in Compliance Center
      6.  
        Using custom scripts to manage data
      7.  
        Viewing and managing the status of an operation
  10. Section X. Reference
    1. Appendix A.  Data Insight best practices
      1.  
        Understanding Data Insight best practices
    2. Appendix B.  Migrating Data Insight components
      1.  
        Migrating Data Insight components
    3. Appendix C. Backing up and restoring data
      1.  
        Selecting the backup and restore order
      2.  
        Backing up and restoring the Data Insight Management Server
      3.  
        Backing up and restoring the Indexer node
      4.  
        Understanding Data Insight best practices
    4. Appendix D. Data Insight health checks
      1. About Data Insight health checks
        1.  
          Services checks
        2.  
          Deployment details checks
        3.  
          Generic checks
        4.  
          Data Insight Management Server checks
        5.  
          Data Insight Indexer checks
        6.  
          Data Insight Collector checks
        7.  
          Data Insight Windows File Server checks
        8.  
          Data Insight SharePoint checks
        9.  
          Classification server health checks
        10.  
          Data Insight self service portal server health checks
    5. Appendix E. Command File Reference
      1.  
        fg.exe
      2.  
        indexcli.exe
      3.  
        reportcli.exe
      4.  
        scancli.exe
      5.  
        installcli.exe
    6. Appendix F. Data Insight jobs
      1.  
        Scheduled Data Insight jobs
    7. Appendix G. Troubleshooting
      1.  
        About general troubleshooting procedures
      2.  
        About the Health Audit report
      3.  
        Location of Data Insight logs
      4.  
        Downloading Data Insight logs
      5.  
        Migrating the data directory to a new location
      6. Troubleshooting FPolicy issues on NetApp devices
        1.  
          Viewing FPolicy-related errors and warnings
        2.  
          Resolving FPolicy connection issues
      7.  
        Troubleshooting EMC Celera or VNX configuration issues
      8.  
        Troubleshooting EMC Isilon configuration issues
      9.  
        Troubleshooting SharePoint configuration issues
      10.  
        Troubleshooting Hitachi NAS configuration issues
      11.  
        Troubleshooting installation of Tesseract software

Configuring advanced settings

You can edit various settings of the Data Insight servers by clicking Settings > Data Insight Servers > click any server to open Advanced Settings page.

The advanced settings are divided into the following categories:

  • Filesystem Scanner settings - Configures how the server scans file systems. Data Insight performs two types of scans on the configured shares:

    • Full scans

      During a full scan, Data Insight scans the complete share. These scans can run for several hours, if the share is very big. Typically, a full scan should be run once for a newly added share. After the first full scan, you can perform full scans less frequently based on your preference. Ordinarily, you need to run a full scan only to scan those paths which might have been modified while event monitoring was not running for any reason. In all other cases, the incremental scan is sufficient to keep information about the file system metadata up-to-date.

      See Table: File system scanner settings - Full scan settings.

    • Incremental scans

      During an incremental scan, Data Insight re-scans only those paths that have been modified since the last full scan. It does so by monitoring incoming access events to see which paths had a create event or write event on it since the last scan.

      The incremental scan setting are also applicable to Amazon S3 and OneDrive INCR scan.

      See Table: File system scanner settings - Incremental scan settings.

  • Indexer settings - Configures how the indexes are updated with new information. This setting is applicable only for Indexers.

    See Table: Indexer settings.

  • Audit events preprocessor settings - Configures how often raw access events coming from file servers must be processed before they are sent to the Indexer.

    See Table: Audit events preprocessor settings.

  • High availability settings - Configures how this server is monitored.

    Each server periodically monitors its CPU, memory, state of essential services, number of files in its inbox, outbox, and err folders. Events are published if these numbers cross the configured thresholds. Also, each worker node periodically heartbeats with the Management Server. The Management Server publishes events if it does not receive a heartbeat from a node in the configured interval.

    See Table:  High availability settings .

  • FPolicy safeguard settings - Configures the safeguards related to FPolicy communication. You can either choose to use the global settings or customize the settings for a specific Collector node. You can configure settings for FPolicy Cluster-Mode and 7-mode

    See Table: FPolicy safeguard settings.

  • Report settings - Configures settings for reports.

    See Table:  Reports settings.

  • Classification settings - Configures how the Classification Server scans file system contents.

    Table: Classification settings - Content scans for file system

    [[UNRESOLVED XREF: v125450963]]

  • Windows File Server Agent settings - Configures the behavior of the Windows File Server filter driver. This setting is applicable only for the Windows File Server Agent server.

    See Table: Windows File Server agent settings.

  • Veritas File System server (VxFS) settings - Configures how Data Insight scans the VxFS filer.

    See Table: Veritas File System server settings.

  • NFS settings - Configures how Data Insight scans NFS shares. This setting is not applicable for Linux Collector as Linux Collector scan and fetch content using root privileges.

    See Table: NFS settings.

  • SharePoint settings - Configures the duration for which old audit logs are kept on the SharePoint server. Audit logs that are fetched from the SharePoint server are automatically deleted from the Data Insight database. You can disable this feature at the web application level.

  • Troubleshooting settings - Configures settings that aid troubleshooting.

    See Table: Troubleshooting settings.

  • Set custom properties - Configures certain advanced properties of a Data Insight worker node. Using this facility, you can customize certain properties that are not accessible by the normal settings.

    Note:

    Veritas recommends using the custom properties settings under the guidance of Veritas Support.

You can configure the advanced settings per node or save commonly used settings as a templates. See About node templates.

To configure advanced settings

  1. In the Console, click Settings > Data Insight Servers.
  2. Click the server, for which you want to configure the advanced settings.
  3. Click Advanced settings.
  4. Click Edit.
  5. Make necessary configuration changes, and click Save.

See Managing node templates.

Each of the categories for the advanced settings are described in detail below.

Table: File system scanner settings - Full scan settings

Setting

Description

Maximum scans to run in parallel on this server

The Collector can perform multiple full scans in parallel. This setting puts a limit on the total number of full scans that can run in parallel on a Collector. The default value is two threads. Configure more threads, if you want scans to finish faster.

The setting is disabled by default.

Maximum shares per filer to scan in parallel

If multiple shares of a filer can be scanned in parallel, this setting puts a limit on the total number of shares of a filer that you can scan in parallel.

Default scan schedule

Specifies how often full scans must be performed. By default, full scans are scheduled to repeat at 19:00 P.M. on the last Friday of each month.

Select the check box to override this setting at a filer or at a share level.

Pause scanner for specific times

You can configure the hours of the day when scanning should not be allowed. This setting ensures that Data Insight does not scan during peak loads on the filer.

The setting is enabled by default. Scans resume from the point they were at before they were paused.

Pause scanner schedule

Configures when scanning should not be allowed to run. By default, scanning is paused from 7:00 A.M. to 7.00 P.M., Monday to Friday.

You can specify multiple scanner pause schedules for different days of the week. For example, you can choose to pause scanning from 7:00 A.M. to 7:00 P.M. on weekdays and from 7:00 A.M. to 9:00 P.M. on Saturdays and Sundays.

To add a scanning schedule:

  1. Click Add.

  2. On the Pausing schedule pop-up, select the time period and the days on which you want to pause scanning.

  3. Click Save.

You can also edit or delete existing scanning schedules.

Table: File system scanner settings - Incremental scan settings

Setting

Description

Maximum scans to run in parallel on this server

The Collector can perform multiple full scans in parallel. This setting puts a limit on the total number of full scans that can run in parallel on a Collector. The default value is two threads. Configure more threads, if you want scans to finish faster.

The setting is disabled by default.

Maximum shares per filer to scan in parallel

If multiple shares of a filer can be scanned in parallel, this setting puts a limit on total number of shares of a filer that can be scanned in parallel.

The default value is 2.

Default scan schedule

Specifies how often incremental scans must be performed. By default, incremental scans are scheduled to run at 7:00 P.M. each night.

Schedule incremental scans more or less frequently based on how up-to-date you need information in Data Insight to be.

Pause scanner for specific times

You can configure hours of the day when scanning should not be allowed. This setting ensures that Data Insight does not scan during peak loads on the filer.

This setting is enabled by default. Scans resume from the point they were at before they were paused.

Override default incremental scan schedule (Default: 19:00 on last Friday of each month)

Check the box if you want to override the default incremental scan schedule which runs at 19.00 hrs on last Friday of each month.

Note:

This is applicable for Amazon S3 and OneDrive also.

Pause scanner schedule

Configures when scanning should not be allowed to run. By default, scanning is paused from 7:00 A.M. to 7:00 P.M. Monday to Friday.

Table: File system scanner settings - Throttling for NetApp 7-mode and NetApp cluster-mode filer

Setting

Description

Use global settings

This option is selected by default. When this option is selected, Data Insight uses the throttling thresholds defined on the Settings > Global Settings > Scanning and Event Monitoring page.

See Configuring scanning and event monitoring .

Use custom settings

Select Use custom settings to disable scanning or override the global throttling thresholds and define custom values.

Throttle scanning based on latency of the filer

Clear the check box to disable scanning for the filers that the collector is monitoring.

Select to enable throttling of Data Insight scans for NetApp 7-mode and Cluster-Mode file servers. This option is not selected by default.

Data Insight collects latency information from NetApp file servers. It can use this information to throttle scanning, if latency of the file server increases above a certain level. This ensures scanner does not put additional load on the file server during peak load conditions.

You can configure the following parameters to enable throttling for NetApp file servers:

  • Latency threshold - Specify latency in milliseconds, which when crossed, should throttle scanning for the file server

  • Minimum pause - Specify the minimum duration (in milliseconds) for which the scanner should pause between paths when in throttling mode.

  • Back off value - If increased latency is sustained, pause interval will be increased by the Back off value specified (in milliseconds).

  • Maximum pause - Specify the maximum pause interval for the scanner (in seconds). If exceeded, pause interval is no longer incremented by Back off value.

Table: File system scanner settings - Common settings

Setting

Description

Scanner snapshot interval

Scanning a big share can take several hours. The scanner periodically saves information to a disk so that information is visible sooner without waiting for the entire scan to finish.

You can configure how often information is saved to the disk by the scanner. By default, the scanner creates a snapshot of new information every 300 seconds (5 minutes). The minimum value you can set for this parameter is 300.

Table: Indexer settings

Setting

Description

Total indexer threads

The indexer processes incoming scan and access event information for various shares and updates the per-share database. This setting configures how many databases can be updated in parallel. By default 2 threads are configured.

Specify a larger value for bigger setups where indexer is not able to keep up with incoming rate of information. This is indicated when you observe too many files in the inbox of the Indexer worker node. However, you must ensure that the Indexer has adequate CPU and memory when configuring a higher number of indexer threads. You need approximately 1 GB of RAM per indexer thread.

Limit maximum events processed in memory

By default, the indexer processes all new incoming events in memory before saving the events to the disk. If your are falling short of RAM on your Indexer, you can limit the maximum number of events that the indexer processes in memory before it saves them to the disk.

Note:

Specifying a small number makes the indexing very slow.

Reconfirm deleted paths when reconciling full scan information

After Data Insight indexes full scan data, it computes the paths that no longer seem to be present on the file system. Similarly, if Data Insight discovers folders or subsites missing in a SharePoint site, then it computes those subsites or folders as deleted without performing a reconfirmation scan.

Select this check box to have Data Insight re-confirm if those paths are indeed deleted using an incremental scan before removing them from the index.

When the check box is clear, Data Insight readily removes the missing paths from the indexes without carrying out a re-confirmation.

Note:

Re-confirm scan is not supported for site collections.

Note:

Re-confirm scan is not supported for SharePoint Online, Box, OneDrive, and Amazon S3

Indexer schedule

Specify how often an index should be updated with new information. By default, all new data is consumed once every four hours.

Indexer gets better throughput if more information is given to it when indexing. However, if you configure a very high value, new information will not be visible in the Console for a much longer period.

Indexer integrity checking schedule

Data Insight checks the integrity of its databases once a week. If any errors are found in the database, an event is published. You can configure a different schedule if required.

Table: Audit events preprocessor settings

Setting

Description

Audit events preprocessor schedule

Incoming raw audit events from file servers must be pre-processed before sending them to the Indexer. At this stage, collector.exe applies various heuristics to the raw events and also removes transient events.

By default, raw events are processed every 2 hours.

Batch size (MB)

The maximum size of the raw audit event files that a single Collector thread can process.

The default batch size is 2 GB.

Total Collector threads

The Collector can run multiple pre-processors in parallel. This setting configures how many instances can run in parallel.

Table: FPolicy safeguard settings

Setting

Description

Use global settings

This option is selected by default. Use the FPolicy safeguard settings for NetApp 7-mode and Cluster-mode filers as defined on the Settings > Global Settings > Scanning and Event Monitoring page.

See Configuring scanning and event monitoring .

Data Insight collects latency information from NetApp file servers. It can use this information to initiate safeguard mode, if latency of the file server increases above or falls below a certain level. When the safeguard is in effect, Data Insight drops its FPolicy connection to the filer. This ensures event collection does not put additional load on the file server in peak load conditions.

If the latency on the physical file server increases above the configured threshold, Data Insight disconnects from the associated virtual file server. This information is also displayed on the Data Insight System Overview dashboard.

Use custom settings

Select Use custom settings to disable the safeguard settings or to override the global safeguard thresholds and define custom values.

Enable FPolicy safeguard settings

Select one of the following, as appropriate:

  • Select Enable FPolicy 7-Mode Safeguard to initiate safeguard mode for the NetApp file servers.

  • Select Enable FPolicy 7-Mode safeguard for VFilers to initiate safeguard mode for virtual file servers by monitoring latency statistics for the associated physical NetApp filers. The safeguard mode for the virtual filers is initiated only if the details of the physical filer corresponding to the virtual filer are provided while adding the virtual filer.

  • Select Enable FPolicy Cluster Mode Safeguard to initiate safeguard mode for the NetApp cluster mode file servers.

These FPolicy safeguard settings are not selected by default. When these check boxes are cleared, the safeguard settings are not in effect.

Configure the following values:

  • The high and low thresholds for CIFS and NFS.

  • The number of samples to be considered to calculate the average latency.

  • The minimum time to wait before Data Insight reconnects to the filer after a disconnection.

Table: High availability settings

Setting

Description

Ping timeout (in minutes)

If a worker node does not heartbeat in the specified interval, Management server will publish an event to that effect. This setting is only applicable for the Management Server.

Notify when CPU continuously over (percentage)

If CPU used on this server is consistently over the specified percentage, an event is published. (Default value: 90%)

Notify when memory continuously over (percentage)

If Memory used on this server is consistently over the specified percentage, an event is published. (Default value: 80%)

Notify when disk usage over (percentage)

If disk usage, either for the system drive or data drive, is over the specified threshold, an event is published. (Default value: 80%)

Notify when disk free size under (MB)

If the free disk space for the system drive or data drive is over the specified threshold in megabytes, an event is published. (Default value: 500 MB)

Notify when number of files in err folders over

If Data Insight is not able to process an incoming file for some reason, that file is moved to an err folder. Data Insight publishes an event if number of files in the err folder crosses the specified threshold. (Default value: 50)

Notify when number of files in inbox and outbox folder over

If Data Insight is not able to process incoming data fast enough, the number of files in the transient folders, inbox and outbox, goes on building up. Data Insight publishes an event if number of files crosses the configured threshold. (Default value: 5000)

Table: Classification settings - Content scans for file system

Setting

Description

Maximum content scans to run in parallel on this server

The Classification Server can perform multiple content scans in parallel. This setting puts a limit on the total number of scans that can run in parallel on a Classification Server. The default value is two threads. Configure more threads, if you want scans to finish faster.

Ensure that the server has enough resources to run the configured parallel threads.

Maximum classification threads to run in parallel on this server

When content is being classified in parallel, this setting puts a limit on the total number of classification threads that you can run in parallel.

Maximum shares per filer to content scan in parallel

When multiple shares of a filer are scanned in parallel, this setting puts a limit on the total number of shares of a filer that you can scan in parallel.

Pause content scanner for specific times

You can configure the hours of the day when content scanning should not be allowed. This setting ensures that Data Insight does not scan during peak loads on the filer. The setting is enabled by default. Scans resume from the point they were at before they were paused.

Content scanning is paused by default from Monday to Friday from 7:00 A.M. to 7:00 P.M.

To configure the pause schedule, click Edit on the scheduler to configure the days and time on which content scanning should be paused.

You can configure more than one pause schedules. Click Add to add new pause schedule.

Custom Property

Veritas Classification Server master ID

This property represents the master Classification Server ID assigned to slave Classification Server.

Veritas Classification lb. disabled

This property is applicable to Veritas Classification master server only if at least one server is in the pool used to turn load balancing feature on/off. By default, it is set to false. If you want to disable this feature set its value to true.

Matrix.classify.fetch.max_batches

This is the global property used to set maximum number of batches per priority to keep in content fetch queue on every slave. This is used only if load balancing is enabled. The default value is 10.

Note:

In case of upgrade, if you have configured content scans for file systems and SharePoint, only file systems content scan will prevail, even for SharePoint devices.

Table: Reports settings

Setting

Description

Maximum memory when generating report output

Specifies the maximum memory that can be used for generating a report output. By default, it is 1024 MB on a 32 bit machine and 2048 MB on a 64 bit machine

Total threads for generating report output

Configure the number of threads for generating report output (PDF/HTML/CSV) in parallel. Default value is 2.

This setting applies to the Management Server.

Number of threads for a single report run

Configure the number of threads responsible for generating the report output database for a single report. This configuration applies to the Indexer node.

This setting helps you speed up the process of report generation.

For a particular Data Insight server, the thread count applies to all types of reports.

Maximum reports that can run simultaneously

By default, Veritas Data Insight Software Compatibility List executes two reports in parallel. However, you can configure a higher value to run multiple reports in parallel.

Table: Windows File Server agent settings

Setting

Description

Maximum kernel ring buffer size

The Windows File Server filter driver puts events in an in-memory buffer before the DataInsightWinnas service, consumes them. By default, it uses a 100MB buffer. You can use a bigger buffer. Data Insight publishes an event that indicates events are being dropped due to a high incoming rate.

Note:

Buffer is in kernel and is limited on a 32 bit operating system.

Ignore accesses made by Local System account

The Windows File Server filter driver ignores accesses made by processes running with Local System account. This setting ensures that Data Insight can ignore most events originating from the operating system processes or other services like antivirus and backup.

Clear this check box to enable monitoring accesses made by LOCAL SYSTEM account. This is not recommended on a production file server.

Table: Veritas File System server settings

Setting

Description

Flush events on VxFS filer before audit

Set this option to true, if you want to force VxFS to flush its events to disk each time requests for information. This option is useful in Proof-of-Concept (POC) setups and enables you to see events faster in Data Insight

Maximum number of audit threads

This option determines how many filers to fetch audit information from in parallel.

Maximum kernel ring buffer size (Number of records)

The access event records are saved in a log file on the VxFS filer before Data Insight consumes them. By default, 50,000 records can be saved in the log file. You can also specify a larger number. Data Insight publishes an event that indicates that events are being dropped due to a high incoming rate.

Table: NFS settings

Setting

Description

Set default credentials for NFS scanner

Set this option to true if you want to allow Data Insight to use the specified User and Group ID to log in to scan NFS shares.

User ID

The ID of the NFS user that the Data Insight uses to scan the filer.

You can set the value to 0 to allow root access from the Data Insight scan hosts.

Group ID

The ID of the group that the Data Insight uses to scan the filer.

You can set the value to 0 to allow root access from the Data Insight scan hosts.

Table: Troubleshooting settings

Setting

Setting

Preserve intermediate files

As new data comes into a Data Insight system, it moves between various modules. In this process the original files are deleted and a new processed file is generated for the next stage of processing.

To aid troubleshooting, select this check box to retain the intermediate data files. These files get stored in attic folder in the data directory.

Preserve raw audit event files

Events processed by the Audit Pre-processor stage are deleted once consumed. If this setting is enabled, raw audit event files will be preserved in the attic folder in the data directory.