Veritas Data Insight Classification Guide
- About this guide
- Getting Started
- Managing content classification from Data Insight
- Configuring classification
- Initiating classification
- Appendix A. Classification jobs
- Appendix B. Troubleshooting classification
Configuring classification
You can enable classification from the Classification. The settings let you determine the type of classification Data Insight must perform and specify a limit on the size of files that can be classified.
> underNote:
If you enable classification, Data Insight automatically adds a rule to exclude the audit events generated by the saved credentials used for content scanning. The exclude rule is added to prevent the accesses by the named credentials from being registered and to ensure that the access time of a file is not modified.
Ensure that all prerequisites are met before you configure classification.
To configure classification
- In the Management Console, click Settings > Configuration under Classification.
- On the Classification Configuration page, edit all or any of the following settings:
Enable classification
When the check box selected, you can submit files for classification from the Workspace tab, Reports tab, and Settings tab > Classification > Requests.
See Initiating classification.
Note:
Enable Smart Classification
The check box is selected by default. Clear the check box to disable Smart Classification.
When enabled, Data Insight intelligently analyzes the files to identify sensitive files and submits them for classification.
Enable Optical Character Recognition for classification of images
When this check box is selected, you can classify images.
To allow classification of images, a software called Tesseract is installed on the Management Server, Collector, and Classification server nodes during the installation of Data Insight. The default location for the Tesseract installation is
C:\Program Files (x86)\Tesseract-OCR
. In case the Tesseract installation fails, refer to the Troubleshooting section in the Veritas Data Insight Administrator's Guide to manually install the Tesseract software.Note:
Optical character recognition (OCR) is a performance-expensive feature. Veritas recommends that you select only those file groups that contain the specific file extensions that you need to classify for OCR. By default, the file group, Images for OCR is selected.
Skip classification of files with size greater than
You can set the limit on the size of files which Data Insight submits for classification. Data Insight does not submit those files that exceed the specified size.
- Click Save.
See Configuring safeguard settings for Classification Server.