Diskover Plugins Ecosystem

Powerful Data Management Tools

Diskover’s open-source infrastructure offers limitless extensibility, enabling the development of plugins by Diskover, third parties, or end users to meet specific needs.

Our data management solution includes a wide range of integrated, configurable tools, with many more in the pipeline. A key feature is File Action, which empowers authorized users to perform actions on files directly from the user interface, enhancing workflow efficiency and control.

Due to its open-source infrastructure, Diskover allows for limitless extensibility, whether the plugins are developed by Diskover, third parties, and/or end users. Our data management solution offers various data management tools and File Actions, which allow end users to take action on files directly from the user interface.

Quick Access to a Plugin Category

Extra Metadata Catalog

This diagram gives an overview of how Diskover empowers all levels of an organization by giving examples of the needs, challenges and solutions for different people with different roles. Diskover allows all stakeholders to have their own relationship with data through different tools and features, facilitated by the global index and access to all data in one single view.

Overview:  Additional attribute collected when Diskover first detects a new file in a given location.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Designed to enhance basic metadata with detailed production status information, aligning data management with production schedules. The Diskover Flow Production Tracking Plugin harvests additional attributes from the Autodesk Flow Production Tracking platform for every shot directory located on storage. These attributes become properties of the shot directories and include status information such as finaled, out-of-picture, multiple project tracking dates, and many more, totaling around one hundred indexable fields. Note that users can opt to only index the fields that are relevant to their business.

Status:  🟠 Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to end-user docs

Overview:  The BAM plugin enables additional metadata collection for BAM (Binary Alignment Map) and SAM (Sequence Alignment Map) file formats. These extra fields become searchable, reportable for analysis, and actionable, allowing for potential upstream file management, manually or via automated scheduled tasks.

Status:  🟠 Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

📓 Refer to technical docs

Overview:  Collects dozens of PowerScale SmartPools attributes that are key for granular analysis, workflow automation, and overall data curation.

Status:  🟠 Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

ℹ️ Learn more

Diskover Scalable Dell PowerScale Indexing Architecture

Overview:  Adds xxhash, md5, sha1, and sha256 hash values to files in Elasticsearch indices to use for checksums/data integrity. Hash values are like fingerprints; they are unique to each file. They are the results of a cryptographic algorithm, which is a mathematical equation with different complexity and security levels, used to scramble the plain text and make it unreadable. They are used for data encryption, authentication, and digital signatures.

MD5 SHA1 SHA256 XXHASH
128 bits 160 bits 256 bits Between 135 and 256 bits
Security Level: Moderate Security Level: Moderate Security Level: High Security Level: Moderate
Algorithm Complexity: Simple Algorithm Complexity: Complex Algorithm Complexity: Complex Algorithm Complexity: Simple
The MD5 message-digest algorithm is a widely used hash function producing a 128-bit hash value. MD5 was designed by Ronald Rivest in 1991 to replace an earlier hash function MD4, and was specified in 1992 as RFC 1321. MD5 can be used as a checksum to verify data integrity against unintentional corruption. SHA-1 is a hash function which takes an input and produces a 160-bit hash value known as a message digest – typically rendered as 40 hexadecimal digits. It was designed by the United States National Security Agency, and is a U.S. Federal Information Processing Standard. The SHA-2 family is a set of cryptographic hash functions designed by the United States National Security Agency and first published in 2001. They are built using the Merkle–Damgård construction, from a one-way compression function itself built using the Davies–Meyer structure from a specialized block cipher. XXHASH is an Extremely fast Hash algorithm, running at RAM speed limits. It successfully completes the SMHasher test suite which evaluates collision, dispersion and randomness qualities of hash functions. Code is highly portable, and hashes are identical on all platforms (little / big endian).

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
 

📹 Watch this video for a use case example of hash values

Overview:  Adds md5 and sha1 hash values using AWS Lambda/Fixity to files in Elasticsearch indices built using S3 alt scanner. Hash values are like fingerprints; they are unique to each file. They are the results of a cryptographic algorithm, which is a mathematical equation with different complexity and security levels, used to scramble the plain text and make it unreadable. They are used for data encryption, authentication, and digital signatures.

MD5 SHA1
128 bits 160 bits
Security Level: Moderate Security Level: Moderate
Algorithm Complexity: Simple Algorithm Complexity: Complex
The MD5 message-digest algorithm is a widely used hash function producing a 128-bit hash value. MD5 was designed by Ronald Rivest in 1991 to replace an earlier hash function MD4, and was specified in 1992 as RFC 1321. MD5 can be used as a checksum to verify data integrity against unintentional corruption. SHA-1 is a hash function which takes an input and produces a 160-bit hash value known as a message digest – typically rendered as 40 hexadecimal digits. It was designed by the United States National Security Agency, and is a U.S. Federal Information Processing Standard.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📹 Watch this video for a use case example of hash values

Overview:  Provides the ability to visualize and trend data metrics over time using Grafana. The plugin rolls up summary data and creates Grafana-specific indices within Elasticsearch. These indices use time series @timestamp metrics to separate logstash- indices, indexes directory size, counts up to N dir depths (default 2). Elasticsearch can then use these summary indexes as a data source for viewing these logstash indices from Grafana.

Status:  🟠   Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Adds business context and searchability via additional media file attributes (resolution, codec, pixel format, etc.). The enriched metadata is key for granular analysis, workflow automation, and overall data curation.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to technical docs

📓 Refer to end-user docs

Overview:  Collects additional user-injected S3 attributes that have been added by an organization and puts them in the Diskover index. These extra fields become searchable, reportable for analysis, and actionable, allowing for potential upstream file management, manually or via automated scheduled tasks.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Adds the Unix permission attributes of each file and directory to the Elasticsearch data catalog during indexing. Two tags are added, unixperms-plugin and ugo+rwx, if a file or directory is found with fully open permissions (777 or 666). 

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media

📓 Refer to technical docs

Overview:  Adds Windows file owner, primary group, and DACL info to files and directories in Elasticsearch indices.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

Overview:  Adds the Windows file owner and primary group of each file and directory to the Diskover index at time of indexing. It replaces all docs showing username 0 with the Windows file/directory owner name. 

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media

📓 Refer to technical docs

Overview:  The Xytech Asset Creation Plugin is designed to find and index newly restored online assets from LTO tapes, removable hard drives, etc., making them available, findable, and searchable within the Xytech Media Operations Platform as well as within Diskover. This plugin adds business context following assets rehydration via asset ID attributes. The enriched metadata is key for granular analysis, workflow automation, and overall data curation.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to technical docs

📓 Refer to end-user docs

Overview:  The Xytech Order Status Plugin is designed to automate the correlation of the order management system and the storage system, by harvesting key business context from Xytech and applying that context within the AJA Diskover Media Edition. In turn, this business context metadata (order phase, invoice date, etc.) can be used to automate workflows, curate data, monitor costs, create highly customized reports, and search granularly.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to end-user docs

Data Curation Plugins

Diagram representing that data is now treated like a physical good and goes through a virtual conveyor belt. Diskover's plugins and other features help the automation of data manufacturing, making it efficient while tracking storage space costs. Workflows are essential to sustainable data management.

Overview:  Designed to move, copy, delete, rename, or run custom commands on files and/or directories based on a set of highly configurable criteria. Any Elasticsearch query (tags, age, size, path, filename, extra metadata collected during post-index, etc.) can be used for the criteria providing very granular actions.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

Overview:  Diskover provides the ability to apply tags either at 1) time of index or 2) post-index which is designed to auto-tag an existing completed index. Auto-tagging can also be done during crawl time by adding tag rules in the config file. Tags can be applied automatically via a series of rules applied to directories or files. The rules, based on a combination of regular and business context metadata, can be very powerful.

Status:  🟢 Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

Overview:  Precise duplicate files' checksums across a single or all indices using xxhash, md5, sha1, and sha256 hash values. The plugin is designed for multiple use cases:

  • To check for duplicate files across a single or all file systems (single or multiple indices) and index the file docs in index/indices that are dupes.
  • To calculate the file checksums/hashes for all duplicate files or all files and index hashes to file docs in an index.
  • Reports can be customized to easily locate all duplicate files for clean-up efforts and/or to secure sensitive information.

Calculating file hash checksums is an expensive CPU/disk operation. The dupes finder provides configuration options to control what files in the index get a hash calculated and marked as a dupe. The duplicates plugin harvests hash values that can be stored only for duplicates or for all files.

In addition, the dupes finder provides additional optimization mechanisms as described in the Diskover Configuration and Administration Guide.

Status:  🟢 Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
 

📓 Refer to technical docs

Overview:  Migrates Elasticsearch field data from one index to another.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Sends email reports based on Elasticsearch search queries/criteria.

Status:  🟢 Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

Overview:  Designed to migrate tags from one index to the next. Generally, these tags are applied post index through manual tag application or plugin tag application.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

Data Mover Plugins

Diskover addresses what data belongs where and when. This image is a simplified diagram of data mover plugins, how they work, and how they can be automated or user-initiated.

Overview:  With the Ngenea Data Orchestrator File Action, authorized used can quickly and securely transport data, directly from the Diskover UI, to and from globally distributed cloud, object storage, traditional NAS files, and tape resources, automatically moving data into the ‘right cost’ resource according to value and usage as your work teams and business needs demand.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  The Qumulo Data Mover File Action allows authorized users to quickly and securely transport data, directly from the Diskover UI, from Qumulo to Qumulo, Qumulo to AWS, and AWS to Qumulo.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Using the Rclone open-source technology (https://rclone.org/), the Diskover Rclone Data Mover Plugin provides the framework for reliable data movement based on pre-configured source and destination profiles. The plugin provides two mechanisms within Diskover to trigger data movement: 1) on-demand user-initiated file action directly from the Diskover interface, and 2) scheduled automated workflow based on file attributes meeting predetermined criteria.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Regardless of distance and latency, the high-speed data transfer Vcinity Plugin provides the framework for reliable and fast data movement based on pre-configured source and destination profiles. 

The plugin can move NFS, SMB, and S3 to any NFS, SMB, and S3 vendor, no matter the brand, ex: Dell, NetApp, HPE, etc.

The Vcinity High-Speed Data Transfer Plugin provides two mechanisms within Diskover to trigger data movement: 1) on-demand user-initiated file action directly from the Diskover interface, and 2) scheduled automated workflow based on file attributes meeting predetermined criteria.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

null

Data Integrity Plugins

Data costs a lot to create and even more to recreate. These plugins give you peace of mind by easily validating your files’ integrity with perfect accuracy, as well as automatically renaming files with illegal characters and long names.

Overview:  Designed for precise data movement monitoring, the plugin checksums xxhash, md5, sha1, and sha256 hash values between the original file and the resulting file once it reaches its transfer destination, catching any possible file corruption in the process. The plugin alerts on areas where the file checksum of the source location does not match the file checksum of the destination location, which would then require a retransfer of these suspect files. A manifest is generated to provide insurance upon completion that all files arrived uncorrupted.

MD5 SHA1 SHA256 XXHASH
128 bits 160 bits 256 bits Between 135 and 256 bits
Security Level: Moderate Security Level: Moderate Security Level: High Security Level: Moderate
Algorithm Complexity: Simple Algorithm Complexity: Complex Algorithm Complexity: Complex Algorithm Complexity: Simple
The MD5 message-digest algorithm is a widely used hash function producing a 128-bit hash value. MD5 was designed by Ronald Rivest in 1991 to replace an earlier hash function MD4, and was specified in 1992 as RFC 1321. MD5 can be used as a checksum to verify data integrity against unintentional corruption. SHA-1 is a hash function which takes an input and produces a 160-bit hash value known as a message digest – typically rendered as 40 hexadecimal digits. It was designed by the United States National Security Agency, and is a U.S. Federal Information Processing Standard. The SHA-2 family is a set of cryptographic hash functions designed by the United States National Security Agency and first published in 2001. They are built using the Merkle–Damgård construction, from a one-way compression function itself built using the Davies–Meyer structure from a specialized block cipher. XXHASH is an Extremely fast Hash algorithm, running at RAM speed limits. It successfully completes the SMHasher test suite which evaluates collision, dispersion and randomness qualities of hash functions. Code is highly portable, and hashes are identical on all platforms (little / big endian).

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Analyzes the index of all directories and file names for illegal characters, and long filenames or file paths to proactively find potential files with names that can break applications. Offending filenames are tagged with the corresponding non-conformance and the list of illegal filenames can then be sent via email reports. The plugin can be configured to remediate these issues with automatic renaming or character replacement.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

Overview:  Designed to provide a list of file differences between two indices (or points in time). The differential list can be used to feed synchronization tools or identify deltas where two repositories should be identical. Outputs a CSV file containing the differences between the two indices.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

📓 Refer to end-user docs

User-Initiated File Action Plugins

Whether it’s taking targeted actions on data or seamlessly integrating with third-party platforms, Diskover’s open-source architecture empowers users to develop custom File Action Plugins, enabling the automation of unique in-house workflows with ease and flexibility.

Authorized users, following role-based access controls, can initiate File Actions directly from the Diskover interface, providing a versatile range of use cases. Whether it's taking targeted actions on data or seamlessly integrating with third-party platforms, Diskover's open-source architecture empowers users to develop custom File Action Plugins, enabling the automation of unique in-house workflows with ease and flexibility.

Overview:  The CineViewer Player is designed to securely view high resolution media from a remote browser, without giving users access to the source files. Using the File Action feature from the Diskover user interface, CineViewer allows for seamless validation of media assets, therefore increasing productivity, while safeguarding your production network.

Status:  🟢  Released

This plugin can be purchased from CineSys.io and works with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to technical docs

📓 Refer to end-user docs

Overview:  Due to its open-source architecture, Diskover allows users to easily develop their own File Action plugins to automate their custom in-house workflows. These plugins can be launched in one click using Diskover’s File Action feature.

Status:  Python File Actions  🟠  Testing Prototype

Status:  PHP File Actions  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs for Python File Actions...coming soon

📓 Refer to technical docs for PHP File Actions

Overview:  In post-production workflows, producers or client service representatives are often provided with files urgently required for a project via email for example. However, due to security concerns around the production network, these individuals typically don't have read/write access to the production storage so they have no mechanism to upload these files without involving data management or other departments. Acquiring proper authorization on a case-by-case basis introduces delays in time-critical production workflows requiring fast turn-around deliveries. The EDL (Edit Decision List) Upload Plugin provides the capability for such individuals to directly upload files to controlled destinations without providing them actual access to production storage. The EDL Upload Plugin maximizes efficiencies of people, reduces complexities, and safeguards restricted storage networks.

An edit decision list or EDL is used in the post-production process of film and video editing. The list contains an ordered list of reel and timecode data representing where each video clip can be obtained in order to conform the final cut. If you have two clips in your timeline joined by one edit point, your EDL will document the clips being used, their durations, ins and outs, and even any transitions or adjustments to their audio levels that you've applied. An EDL converts each asset instance and edit point into a text-only list of executions that can be used to rebuild the project elsewhere from the same set of original footage.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

Overview:  Designed to find/list any file sequences in a directory from a single file in a sequence.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to technical docs

📓 Refer to end-user docs

Overview:  To change the Unix permissions of a chosen set of files/folders to a configured value.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  Designed for precise data movement monitoring, the plugin checksums xxhash, md5, sha1, and sha256 hash values between the original file and the resulting file once it reaches its transfer destination, catching any possible file corruption in the process. The plugin alerts on areas where the file checksum of the source location does not match the file checksum of the destination location, which would then require a retransfer of these suspect files. A manifest is generated to provide insurance upon completion that all files arrived uncorrupted.

MD5 SHA1 SHA256 XXHASH
128 bits 160 bits 256 bits Between 135 and 256 bits
Security Level: Moderate Security Level: Moderate Security Level: High Security Level: Moderate
Algorithm Complexity: Simple Algorithm Complexity: Complex Algorithm Complexity: Complex Algorithm Complexity: Simple
The MD5 message-digest algorithm is a widely used hash function producing a 128-bit hash value. MD5 was designed by Ronald Rivest in 1991 to replace an earlier hash function MD4, and was specified in 1992 as RFC 1321. MD5 can be used as a checksum to verify data integrity against unintentional corruption. SHA-1 is a hash function which takes an input and produces a 160-bit hash value known as a message digest – typically rendered as 40 hexadecimal digits. It was designed by the United States National Security Agency, and is a U.S. Federal Information Processing Standard. The SHA-2 family is a set of cryptographic hash functions designed by the United States National Security Agency and first published in 2001. They are built using the Merkle–Damgård construction, from a one-way compression function itself built using the Davies–Meyer structure from a specialized block cipher. XXHASH is an Extremely fast Hash algorithm, running at RAM speed limits. It successfully completes the SMHasher test suite which evaluates collision, dispersion and randomness qualities of hash functions. Code is highly portable, and hashes are identical on all platforms (little / big endian).

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  The Grant Plugin has a dual purpose with 1) assisting research institutes in managing their grants/members/storage costs internally, and 2) fulfilling the requirements for the new NIH DMS Policy.

The Grant Plugin collects and parses grants’ metadata (grant number, group ID, etc.) to curated datasets. In turn, staff associated with a specific grant has visibility/searchability of their limited data/grant without access to the source files or other grants. That extra metadata is also available to use for further workflow automation if needed.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

Overview:  This plugin allows users to scan and validate IMF packages before delivery—from a remote platform and regardless of the location of the IMF Package data—saving immense amounts of man-hours over the course of a business year. IMF stands for the Interoperable Master Format, a technical standard used in the media and entertainment industry for exchanging digital content between different platforms and systems. This service helps content creators and providers validate their content and ensure it's ready to share with their distribution partner or client by meeting IMF packaging requirements.

Status:  🟠  Testing Prototype

This plugin can be purchased from Oxagile and works with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to technical docs

📓 Refer to end-user docs

Overview:  Allows read-only access to the live directories to search or drill down freshly onboarded data pre-indexing, as well as proactively copy paths.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

📓 Refer to technical docs

📓 Refer to end-user docs

Overview:  Allows to view/validate PDF files without giving access to the source files.

Status:  🟠  Testing Prototype

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
   

Overview:  This plugin results in a seamless integration with GLIM, allowing end-users to safely view and validate media files, while safeguarding your source assets and production network. Diskover allows users to do advanced searches of media assets, and then launch GLIM in one click via our File Actions.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to end-user docs

Overview:  The Diskover Telestream Vantage Plugin allows end-users to submit files for transcoding directly from the Diskover user interface.

Status:  🟢  Released

This plugin is included with the following edition(s):

Community Essential Professional Enterprise Life Science Media
         

ℹ️ Learn more

📓 Refer to end-user docs

Your first step towards taking control of ALL your data.

Get hands-on experience with the Diskover platform with minimal to no deployment effort by leveraging the Diskover Community Edition within the AWS Marketplace. The Diskover Community Edition is free to use for an unlimited time, however applicable AWS EC2 instance resource charges apply.

Get in Touch to Schedule a Demo or a 30 Day Free Trial

Scroll to Top