Enterprise Vault™ Classification using the Veritas Information Classifier

Last Published:
Product(s): Enterprise Vault (15.0)
  1. About this guide
    1. Introducing this guide
      1.  
        Relationship between the Veritas Information Classifier and other classification methods
    2.  
      What's in this guide
    3. Where to get more information about Enterprise Vault
      1.  
        Enterprise Vault training modules
  2. Preparing Enterprise Vault for classification
    1.  
      About the preparatory steps
    2.  
      What you need
    3.  
      Checking the cache location on the Enterprise Vault storage servers
    4.  
      Setting up the Data Access account
    5.  
      Enabling the Veritas Information Classifier on all Enterprise Vault servers
    6.  
      Configuring the Veritas Information Classifier for secure client connections
  3. Setting up Veritas Information Classifier policies
    1.  
      Introducing Veritas
    2.  
      Opening the Veritas Information Classifier
    3.  
      Finding your way around
    4.  
      Analyzing sample content for policy matches
    5. About policies
      1.  
        Creating policies
      2.  
        About policy conditions
      3.  
        Enabling or disabling policies
      4.  
        Exporting or importing policies
      5.  
        Resetting policies
      6.  
        Deleting policies
    6. About patterns
      1.  
        Creating or editing patterns
      2.  
        Exporting or importing patterns
      3.  
        Deleting patterns
    7. About tags
      1.  
        Creating or editing tags
      2.  
        Exporting or importing tags
      3.  
        About the Enterprise Vault index properties
      4.  
        How classification property values and retention categories interact
      5.  
        Points to note on setting retention categories
      6.  
        Deleting tags
    8. About sentiment analysis
      1.  
        About sentiment conditions
      2.  
        Enforcing sentiment analysis at a site level
  4. Defining and applying Enterprise Vault classification policies
    1.  
      About Enterprise Vault classification policies
    2. Defining classification policies
      1.  
        Configuring classification policies to assign retention categories with the shortest duration
    3.  
      About the PowerShell cmdlets for working with classification policies
    4.  
      Associating classification policies with retention plans
    5.  
      About the PowerShell cmdlets for working with retention plans
    6.  
      Applying retention plans to your Enterprise Vault archives
  5. Running classification in test mode
    1.  
      About classification test mode
    2.  
      Implementing classification test mode
    3.  
      About the PowerShell cmdlets for running classification in test mode
    4.  
      Understanding the classification test mode reports
  6. Using classification with smart partitions
    1.  
      About smart partitions
    2.  
      How Enterprise Vault determines whether to archive an item to a smart partition
    3.  
      Setting up smart partitions
    4.  
      Verifying that Enterprise Vault has archived items to smart partitions
  7. Appendix A. Enterprise Vault properties for use in custom field searches
    1.  
      About the Enterprise Vault properties
    2.  
      System properties
    3.  
      Attachment properties
    4.  
      Custom Enterprise Vault properties
    5.  
      Custom Enterprise Vault properties for File System Archiving items
    6.  
      Custom Enterprise Vault properties for SharePoint items
    7.  
      Custom Enterprise Vault properties for Compliance Accelerator-processed items
    8.  
      Custom properties for use by policy management software
    9.  
      Custom properties for Enterprise Vault SMTP Archiving
  8. Appendix B. PowerShell cmdlets for use with classification
    1.  
      About the classification cmdlets
    2.  
      Disable-EVClassification
    3.  
      Get-EVClassificationPolicy
    4.  
      Get-EVClassificationStatus
    5.  
      Get-EVClassificationTestMode
    6.  
      Get-EVClassificationVICTags
    7.  
      Initialize-EVClassificationVIC
    8.  
      Set-EVClassificationVICFIPSMode
    9.  
      New-EVClassificationPolicy
    10.  
      Remove-EVClassificationPolicy
    11.  
      Set-EVClassificationPolicy
    12.  
      Set-EVClassificationTestMode
  9. Appendix C. Classification cache folder
    1.  
      How Enterprise Vault caches the items that it submits for classification
    2.  
      Limits on the size of classification files
    3.  
      Configuring Enterprise Vault to keep the classification files in the cache folder
  10. Appendix D. Migrating from FCI classification to the Veritas Information Classifier
    1.  
      Converting FCI classification rules for use with the Veritas Information Classifier
  11. Appendix E. Monitoring and troubleshooting
    1.  
      Auditing
    2.  
      Checking the classification performance counters
    3.  
      Troubleshooting classification
    4.  
      Searching archives for items that the Veritas Information Classifier has classified
    5.  
      Troubleshooting language detection

Creating policies

The Veritas comes with a large number of built-in policies, but you can create custom policies if the built-in ones do not meet your needs.

You can also edit existing policies. However, in the case of the built-in policies, the changes that you can make are quite limited.

To create a new policy

  1. In the left navigation pane, click Policies.
  2. Click New.

    The New Policy dialog box appears.

  3. Specify the following details:

    Name

    Specifies the policy name. The name must be unique, and it can contain up to 100 alphanumeric, space, and special characters.

    Status

    Enables or disables the policy. You must enable the policy if you want the Veritas to check for and tag the items that match the policy.

    Description

    (Optional.) Provides a short description of the policy for display in the Veritas .

    Risk weight

    Specify the risk weight for the policy. This is a mandatory field.

    By default, the risk weight value of all the custom policies and most of the built-in policies is configured as 1. Users can modify the risk weight value in the range of 0 to 10.

    Tags

    Nominates one or more tags that you want to apply to the items that match the policy conditions. Click the Tags field to choose from a list of the available tags.

    Conditions

    Specifies one or more conditions that an item must meet for the Veritas to consider it a match.

    Click + Condition to add a new condition for this policy.

    Click + Group to add a new group of conditions for this policy.

    See About policy conditions.

  4. To test the policy that you are creating, under Test, click Browse and then select an item that ought to match it.

    Note:

    This test facility can help to confirm that the policy works as you expect. However, we recommend that you run the PowerShell cmdlet Get-EVClassificationVICTags against one or more test items to make certain that this is the case.

    • To test the same document again, click the Refresh icon.

    • To perform sentiment analysis on the selected item (to determine whether the sentiment associated with the item is positive, negative, or neutral), select the Perform sentiment analysis check box. If this check box is selected, a sentiment score tag is displayed to show the sentiment analysis score. The sentiment analysis score details are displayed if the policy has a condition associated with the sentiment, and the item fulfills this criteria.

    • To extract information from images and perform classification using Optical Character Recognition (OCR), select the Include text in images check box. It extracts the English language text.

      Note:

      The Include text in images check box is displayed only when the Tesseract software is installed on the system where Veritas is running.

      Limitations

      Optical Character Recognition processing rate for PDF files is considerably slow.

      In addition to this, OCR will not extract information from:

      • images that are rotated for 45 degree.

      • a single tiff or gif file having multiple pages.

      • handwritten image.

      • passport image.

    Refer to the test functionality results of the sample file for better understanding.

    After finding a match, do the following:

    • Click Show details to see the matching text and confidence levels as shown in the following sample image.

    • Click More details to see the risk level and risk score of the document as shown in the following sample image.

  5. After testing the policy, click Save.