Image representing the synergy of protons around a molecule which is a broad representation of science in general and used as Diskover logo for its life science edition.

Diskover Life Science Edition

Scientific Data Management

Curated tools for scientific data management and assistance with grant funding data requirements.

Making Smart Data Smarter

Managing life-changing research with results comprised of petabytes of data spread across different storage repositories requires a comprehensive and sustainable solution. The amount of research data created by the life science sector is staggering, therefore the need to find, organize, automate, and act on this invaluable resource is crucial. The scalability, high indexing speed, and the other core features of Diskover address these needs, and with the Life Science Edition, additional tools are available to manage the specific challenges facing this industry.

Diskover is developing new plugins for life science. Please contact us for more information.

“We are extremely grateful to be working openly with major research institutes. Our clients have given our team an improved understanding of how data management can better serve the scientific community. We are honored to offer tools and continuously develop new features that will help many other life science institutes for the betterment of human life.“

– Paul Honrud, CEO | Diskover Data

Your first step towards taking control of ALL your data.

Get hands-on experience with the Diskover platform with minimal to no deployment effort by leveraging the Diskover Community Edition within the AWS Marketplace. The Diskover Community Edition is free to use for an unlimited time, however applicable AWS EC2 instance resource charges apply.

The National Institutes of Health (NIH) New Data Management and Sharing (DMS) Policy

Do you have a data management solution in place to easily fulfill the NIH DMS requirements?

National Institute of Health (NIH) Logo

As of January 25, 2023, the National Institutes of Health (NIH) has released their new Data Management and Sharing (DMS) Policy[1] governing the submission, sharing, and preservation of scientific data in order to accelerate biomedical research discovery, in part, by enabling validation of research results, providing accessibility to high-value datasets, and promoting data reuse for future research studies.

Therefore, all research institutes need to enhance their data management to meet the DMS Policy and comply with the NIH Institute, Center, or Office (ICO)-approved plans throughout the grant process:

  • Grant Application: Prospectively plan for how scientific data will be shared and preserved, as well as submit the forecasted budget for data storage.
  • Funding/Support Period: Tracking of data and its storage cost used for a specific grant funding, with NIH compliance oversight at any time, including the annual submission of the Research Performance Progress Report (RPPR)[2] by grants’ recipients.
  • Post-Funding: Shared scientific data should be made accessible as soon as possible, and no later than the time of an associated publication[3], or the end of the award/grant period, whichever comes first. Non-compliance with the NIH ICO-approved Plans may be taken into account for future funding decisions.

[1] Complete NIH DMS Policy at https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html

[2] https://grants.nih.gov/grants/rppr/index.htm

[3] The DMS Policy is designed to increase the sharing of scientific data, regardless of whether a publication is produced.

Diskover Provides Tools to Meet and Exceed the NIH DMS Requirements

The Life Science Edition of Diskover Data Management Software provides capabilities for research centers to meet the data management and sharing policies required by the NIH, while also ensuring prudent use of storage infrastructure to reduce grant infrastructure spending and shift these funds towards valuable research.

Research Data Management | Visibility Without Access

Diskover provides visibility without access to research data enabling the data technicians to associate data to appropriate grant numbers. The software harvests rich metadata from the selected storage repositories, therefore end-users have access to merely the indexes of files and not the files themselves, hence assuring the integrity of the research data. The Diskover solution provides mechanisms to associate grants’ metadata (grant number, group ID, etc.) to their respective datasets.

Data Storage Cost

Prudent data management eliminates unnecessary or wasteful storage usage, therefore reducing storage infrastructure costs to ensure funding dollars are used for actual research. Diskover has an integrated storage cost feature offering granular configuration and dedicated data cost per storage repository, in turn allowing for real-time data cost monitoring. The current challenge is to further associate those storage costs per grant which is addressed with the Diskover Grant Plugin.

Storage cost feature offering granular configuration and dedicated data cost per storage repository, in turn allowing for real-time data cost monitoring.

Indexing Architecture and Speed

Diskover’s open-source architecture is highly configurable and can be extended to answer current and future exigencies. The result is better performance for all users, as well as accelerated science research.

Diskover is naturally prepared to deal with scientific research generating enormous amounts of data as it uses Elasticsearch in its backend architecture. This open-source software is extremely powerful, fast, reliable, and can handle massive numbers of files. Diskover’s unique architecture allows for large-scale storage repositories to be scanned continuously and in parallel.

Grant Plugin

Diskover recently developed a Grant Plugin which has a dual purpose with 1) assisting research institutes in managing their grants/members/storage costs internally, and 2) fulfilling the requirements for the new NIH DMS Policy.

The Grant Plugin collects and sparses grants’ metadata (grant number, group ID, etc.) to curated datasets. In turn, staff associated with a specific grant has visibility/searchability of their limited data/grant without access to the source files or other grants. That extra metadata is also available to use for further workflow automation if needed.

Adding grant-related metadata to actual properties of files, folders, and objects provides visibility and accountability of the actual data per grant. The primary investigators now have the ability to search and analyze data associated with their research grants.

The Diskover Life Science Edition facilitates the data management requirements from the grant application funding process, to the ongoing data management requirements during the actual research phase, to the eventual publishing required by the NIH DMS Policy.

Diskover’s goal is to offer the most sustainable data management solution to help organizations increase their productivity and reduce their expenditures. For example, users can automate workflows around the life cycles of projects/data, therefore managing and reusing storage instead of buying additional storage due to obsolete data accumulation.

Diskover aims at maximizing and automating data processes to save organizations man-hours and reduce human-prone errors.

The bottom line for research institutes, is the less money spent on infrastructure the more can be allocated to research itself.

BAM Harvest Plugin

The BAM plugin allows for the curation of genome sequence file transformation and curation.

The BAM harvest plugin is designed to provide BAM and SAM metadata attributes about a file without granting the Diskover user any read/write file system access for data integrity measures.

The BAM plugin enables additional metadata for the SAM and BAM file formats to be harvested at time of indexing, therefore those extra fields are searchable, reportable for analysis, and actionable, allowing for potential upstream file management, manually or via automated scheduled tasks.

🖱️ Learn more about end-user interaction with the plugin.
🖱️ Learn more about the technical integration of the BAM plugin.

BAM Plugin Workflow Diagram

Diagram of the workflow of the BAM plugin which is part of the Diskover Life Science Edition.

BAM Plugin in the Diskover User Interface

Image representing the Diskover file search page reflecting the bam_info column resulting from extra metadata collected via the BAM plugin included in the Diskover Life Science Edition.
Bam_info fields as viewed within Diskover file search page.

BAM Attributes

Image representing the Diskover file attributes page reflecting the bam_info fields resulting from extra metadata collected via the BAM plugin included in the Diskover Life Science Edition.
Bam_info fields as viewed in the file attributes window.

Get in Touch to Schedule a Demo or a 30 Day Free Trial

Scroll to Top