The pace of change in life sciences is staggering as new technology allows researchers to see and do things never before possible. Laboratory instruments such as multi-dimensional microscopy, functional MRI, and DNA sequencers generate terabytes of data each, adding up to petabyte-scale storage requirements. DNA sequencing has seen dramatic proliferation due to a rapid reduction in cost. Thousands of sequencers sit in labs and hospitals around the world as they begin to sequence patients’ genomes to customize care. A single DNA sequencer can easily generate multiple terabytes of data per day.

The challenge in today’s life sciences world is how to efficiently store and manage the massive amount of diverse data and access patterns with concurrent usage for local and remote users and applications. The overwhelming requirement for researchers is shared access to common storage so lab instruments, compute clusters and workstations see all the same data. They also want access to all data whether derived or primary. The answer unfortunately often leads to implementing SAN solutions at the lab level that creates silos of data and are expensive to scale.

Traditional Storage is not Suited for Today’s Life Science Workloads

Much of the data would eventually be stored on tape – that is not possible today due to the requirement for global online access and collaboration. Traditional RAID-based SAN systems are expensive and complex, and by themselves, cannot guarantee data will never be lost. This requires a backup solution that increases operational cost. All of this adds unnecessary complexity and overhead that strains IT budgets.

Life Sciences Storage Requirements

Easy to scale performance and capacity without service disruption
A global namespace that can handle a large number of objects and buckets
Easily integrates with existing applications without modification
Concurrent access by large number of users/applications
No single point of failure, high data integrity and durability
Extremely low TCO at approximately 50% less than traditional storage systems
Easily adapts to changing needs without any disruptions of service


Amplidata has addressed these and other storage challenges with Himalaya™, an object-based software defined storage platform that delivers unbreakable durability, infinite, scalability, and extreme efficiency. It is optimized for Intel-based commercial off-the-shelf hardware from a variety of vendors and provide flexible, cost effective exabyte scalability. Designed to take full advantage of the latest Intel* Xeon* Processors, performance scales linearly with each controller added to the system.

Get the Life Sciences Solution Brief