SOLUTIONS for LIFE SCIENCE
Accelerate scientific discovery with advanced data tools.
Managing life-changing research with results comprised of petabytes of data spread across different storage repositories requires a comprehensive and sustainable solution.
Whether meeting NIH data requirements or streamlining collaboration, Diskover transforms complex datasets into actionable insights through specialized metadata harvesting and workflows.
USING DISKOVER TO OPERATIONALIZE LIFE SCIENCE DATA PIPELINES
USE CASE
World-renowned institute builds key infrastructure for clinical and research pathology teams.
CHALLENGE
MANAGE
AUTOMATE
BENEFITS
MAKING SMART DATA SMARTER
Tools to meet and exceed NIH requirements.
Do you have a data management solution in place to easily fulfill the NIH DMS requirements?
NIH Data Management and Sharing (DMS) Policy Requirements
As of January 25, 2023, the National Institutes of Health (NIH) has released its Data Management and Sharing (DMS) Policy governing the submission, sharing, and preservation of scientific data in order to accelerate biomedical research discovery in part, by enabling validation of research results, providing accessibility to high-value datasets, and promoting data reuse for future research studies.
Therefore, all research institutes need to enhance their data management to meet the DMS Policy and comply with the NIH Institute, Center, or Office (ICO)-approved plans throughout the grant process:
Compliance simplified.
The Diskover life science tools are designed for research centers to meet the data management and sharing policies required by the NIH, while also ensuring prudent use of storage infrastructure to reduce grant infrastructure spending and shift these funds towards valuable research.
Visibility without access.
Diskover provides visibility without access to research data, enabling the data technicians to associate data with appropriate grant numbers. The software harvests rich metadata from the selected storage repositories; therefore, end-users have access to merely the indexes of files and not the files themselves, hence assuring the integrity of the research data. The Diskover solution provides mechanisms to associate grants’ metadata (grant number, group ID, etc.) to their respective datasets.
Cost control.
Prudent data management eliminates unnecessary or wasteful storage usage, therefore reducing storage infrastructure costs to ensure funding dollars are used for actual research. Diskover has an integrated storage cost feature offering granular configuration and dedicated data cost per storage repository, in turn allowing for real-time data cost monitoring. The current challenge is to further associate those storage costs per grant which is addressed with the Diskover Grant Plugin.
Storage cost feature offering granular configuration and dedicated data cost per storage repository, in turn allowing for real-time data cost monitoring.
Scalable, reliable, fast.
Diskover’s open-source architecture is highly configurable and can be extended to answer current and future exigencies. The result is better performance for all users, as well as accelerated science research.
Diskover is naturally prepared to deal with scientific research generating enormous amounts of data as it uses Elasticsearch in its backend architecture. This open-source software is extremely powerful, fast, reliable, and can handle massive numbers of files. Diskover’s unique architecture allows for large-scale storage repositories to be scanned continuously and in parallel.
Diskover is your trusted ally at every step of your grant journey.
GET STARTED WITH
DISKOVER
Ready to manage your data everywhere from anywhere?
WORKFLOW OPTIMIZATION PLUGIN
Grant plugin.
Curated tool for scientific data management and assistance with grant funding data requirements.
Streamlines grant funding requirements.
The Grant plugin serves a dual purpose: it assists research institutes in managing grants, members, and storage costs internally while ensuring compliance with the NIH DMS Policy.
The Grant plugin efficiently collects and organizes grant metadata (e.g., grant numbers, group IDs) into curated datasets. This allows staff associated with a specific grant to access and search their allocated data without needing access to source files or unrelated grants. The metadata can also be leveraged for further workflow automation when needed.
By linking grant-specific metadata to files, folders, and objects, the Grant plugin enhances visibility and accountability for grant-related data. Principal investigators gain the ability to search, analyze, and manage datasets tied to their research grants, streamlining both compliance and operational efficiency.
More insights, minimum effort.
The Diskover Life Science Edition facilitates the data management requirements from the grant application funding process, to the ongoing data management requirements during the actual research phase, to the eventual publishing required by the NIH DMS Policy.
Save time, storage, and overall resources.
Diskover’s goal is to offer the most sustainable data management solution to help organizations increase their productivity and reduce their expenditures. For example, users can automate workflows around the life cycles of projects/data, therefore managing and reusing storage instead of buying additional storage due to obsolete data accumulation.
Diskover aims at maximizing and automating data processes to save organizations man-hours and reduce human-prone errors.
The bottom line for research institutes, is the less money spent on infrastructure the more can be allocated to research itself.
METADATA HARVEST PLUGIN
BAM plugin.
Key metadata for efficient curation of genome sequence files.
The BAM plugin allows for the curation of genome sequence file transformation and curation.
The BAM harvest plugin is designed to provide BAM and SAM metadata attributes about a file without granting the Diskover user any read/write file system access for data integrity measures.
The BAM plugin enables additional metadata for the SAM and BAM file formats to be harvested at time of indexing, therefore those extra fields are searchable, reportable for analysis, and actionable, allowing for potential upstream file management, manually or via automated scheduled tasks.