Veritas Data Insight 6.1 Installation Guide
- Understanding the Veritas Data Insight architecture
- About the Collector worker node
- About Veritas Data Insight installation tiers
- Preinstallation
- Installing Veritas Data Insight
- Upgrading Veritas Data Insight
- Post-installation configuration
- Installing Windows File Server agent
- Getting started with Data Insight
- Uninstalling Veritas Data Insight
- Appendix A. Installing Data Insight using response files
About the Scanner
The Scanner is a Data Insight process that scans enterprise data repositories by mounting CIFS and NFS network shares or accessing SharePoint servers using the Data Insight web service. The Scanner captures the file or folder hierarchy of a shares, site collections or equivalent data sources and helps you collect in-depth information about files and folders.
Note that the Scanner is a scheduled process. Schedule of the scan can be controlled at the worker node level, filer/web application level, or the shares, site collections or equivalent data sources level. For detailed information on administration topics (including how to schedule scanning) see the Veritas Data Insight Administrator's Guide.
Depending on how the scans are scheduled, the Scanner stores the collected data in separate database files, with appropriate timestamp. For each subsequent incremental scan, the Scanner only scans the files that are added or modified since the last full scan. In case of a full scan, the Scanner scans the data source hierarchy again. These files are eventually uploaded to the Indexer node using the Communication Service.
See About the Indexer worker node.
The Scanner captures information about the following attributes for each file or directory:
The size of a file.
The access time.
The creation time.
The modification time.
The Security ID of the file owner (SID).
The Access Control Lists (ACLs).
Note:
Permission information is not fetched for Documentum, SharePoint Online, and OneDrive data sources.
The details the Scanner captures helps in the computation of metadata-based data ownership.