Platform Requirements
Platform Requirements
Two of the Diskover’s deployment components, Elasticsearch and web-server, can be hosted on-premise or in the cloud, and the third component, Diskover file indexer, is typically deployed using the customer’s on-premise resources. Contact us for further technical specifications.
It is recommended to separate the Elasticsearch, web-server and indexing host(s). Indices ideally should be on SSD. NFS data stores do not usually perform well for indices.
📕 Access the Diskover Installation Guide for more information.
Best Practices
Performance, scalability, or recovery issues outside of our recommended best practices of a minimum of 3 Elasticsearch nodes are not guaranteed/supported by Diskover and will incur a support charge of $10,000. Included in the list of problematic issues when buying less than 3 Elasticsearch nodes are, but not limited to, support for multiple geographic locations, high-frequency indexing, a large amount of data, and a large number of file systems.
Architecture Diagrams
Prerequisites
Main Requirements
Other Notes
Elasticsearch Domain
The foundation of the Diskover platform consists of a series of Elasticsearch indexes. These indexes are created and stored within the Elasticsearch endpoint. Elasticsearch is a scale out architecture using 1 to N nodes.
🖱️ Click here for more detailed Elasticsearch and AWS sizing guidelines.
🖱️ Click here for information on resilience in small clusters.
Elasticsearch Cluster
Production Deployments
Proof of Concept
Indices
Rule of Thumb Shard Size
Examples
Estimating Elasticsearch Storage Requirements
Individual Index Size
Rolling Indices
Diskover-Web Server
The Diskover-Web HTML5 user interface requires a Web server platform. A Linux or Windows instance can be configured with applications to provide web serving capabilities is required. The Diskover-Web user interfaces provides visibility, analysis, and actions from the indexes that reside on the Elasticsearch endpoint.
Multiple indexers can be ran on a single machine or multiple machines for parallel crawling.
Linux
Windows
Recommended
Minimum
Diskover Indexer(s)
You can install Diskover indexers on a server or virtual machine (VM). Multiple indexers can be ran on a single machine or multiple machines for parallel crawling.
Linux
Windows
Mac
Recommended
Minimum
AWS Sizing Resource Requirements
🖱️ Click here to access the Diskover AWS Customer Deployment Guide for more requirements information.
Elastisearch Domain
The foundation of the Diskover platform consists of a series of Elasticsearch indexes. These indexes are created and stored within the AWS Elasticsearch endpoint. The recommended AWS nodes are:
Minimum
Recommended
EC2 Web-Server
The Diskover-Web HTML5 user interface requires a web-server platform. An EC2 instance configured with applications to provide web serving capabilities is required. The Diskover-Web user interfaces provides visibility, analysis, and actions from the indexes that reside on the AWS Elasticsearch endpoint. The recommended EC2 instances are:
Minimum
Recommended
Indexer(s)
The recommended instances are:
Minimum
Recommended
Skills and Knowledge Requirements
Although the simplification of the installation and configuration of the Diskover software is in the works, as of now, the installation is intended to be performed by service professionals and system administrators. The installer should have strong familiarity with:
- Operating System on which on premise Diskover file Indexer(s) are installed.
- Basic knowledge of:
- Operating System on which Diskover-Web HTML5 user interface is installed.
- Configuring Web Server (Apache or NGINX).
Important!
⚠️ Attempting to configure Diskover Data Curation platform without proper experience or training can affect system performance and security configuration.
⏱️ The initial installation, configuration, and deployment of Diskover Data is expected to take between 1 to 3 hours depending on time consumed with network connectivity.