Veritas Data Insight Classification Guide

Last Published:
Product(s): Data Insight (6.1.6)

Configuring classification

You can enable classification from the Settings > Configuration under Classification. The settings let you determine the type of classification Data Insight must perform and specify a limit on the size of files that can be classified.

Note:

If you enable classification, Data Insight automatically adds a rule to exclude the audit events generated by the saved credentials used for content scanning. The exclude rule is added to prevent the accesses by the named credentials from being registered and to ensure that the access time of a file is not modified.

Ensure that all prerequisites are met before you configure classification.

To configure classification

  1. In the Management Console, click Settings > Configuration under Classification.
  2. On the Classification Configuration page, edit all or any of the following settings:

    Enable classification

    When the check box selected, you can submit files for classification from the Workspace tab, Reports tab, and Settings tab > Classification > Requests.

    See Initiating classification.

    Note:

    Enable Smart Classification

    The check box is selected by default. Clear the check box to disable Smart Classification.

    When enabled, Data Insight intelligently analyzes the files to identify sensitive files and submits them for classification.

    See About Smart Classification.

    Enable Optical Character Recognition for classification of images

    When this check box is selected, you can classify images.

    To allow classification of images, a software called Tesseract is installed on the Management Server, Collector, and Classification server nodes during the installation of Data Insight. The default location for the Tesseract installation is C:\Program Files (x86)\Tesseract-OCR. In case the Tesseract installation fails, refer to the Troubleshooting section in the Veritas Data Insight Administrator's Guide to manually install the Tesseract software.

    Note:

    Optical character recognition (OCR) is a performance-expensive feature. Veritas recommends that you select only those file groups that contain the specific file extensions that you need to classify for OCR. By default, the file group, Images for OCR is selected.

    Skip classification of files with size greater than

    You can set the limit on the size of files which Data Insight submits for classification. Data Insight does not submit those files that exceed the specified size.

  3. Click Save.

See Configuring safeguard settings for Classification Server.

See Initiating classification.

See Viewing classification status.