Wednesday, June 10, 2026
HomeBig DataHammerspace Hits the Market with International Parallel File System

Hammerspace Hits the Market with International Parallel File System

[ad_1]

(whiteMocca/Shutterstock)

Since he left Fusion-io eight years in the past, David Flynn has been working to resolve the damaged relationship between information and storage. That effort gave rise to Hammerspace and its International Knowledge Atmosphere, which delivers a single world namespace for customers and purposes to entry recordsdata, in addition to a metadata-driven administration layer that eliminates the necessity to copy, transfer, and handle information because it sits in numerous silos.

As CEO and co-founder of Fusion-io (acquired by SanDisk in 2014 for $1.1 billion), Flynn performed a number one function in unlocking the ability of solid-state storage for a brand new class of shoppers. Storing information on quick on NVMe drives proper subsequent to compute helped to obliterate the storage I/O bottleneck. But it surely inadvertently helped to create one other bottleneck: the fast proliferation of silos of information which might be tightly coupled to the storage infrastructure on which it lives.

“Mainly, at a really foundational stage, the connection between information and the storage infrastructure is definitely fairly damaged,” Flynn mentioned. “Knowledge will not be actual. Knowledge is a mirage that’s introduced by the storage infrastructure.”

This mirage forces enterprises with giant information units and huge, dispersed groups to make tradeoffs in how they’ll entry that information. Whereas object storage methods like S3 have principally infinite scalability, Flynn calls it a “cop out” as a result of it introduces latencies which might be unacceptable whereas additionally pushing information administration into the applying. That leaves enterprises with a mish-mash of object, file, and block storage methods that don’t meet wants and requires fixed handbook intervention by storage directors to maintain it contemporary and relevent.

“I grew to become keenly conscious of this at Fusion-io, the place introducing what has develop into NVMe flash and server-local flash having such excessive excessive efficiency gave a extremely good cause to wish to… co-locate [data] down on the servers which might be utilizing it,” mentioned Flynn, who’s the CEO and co-founder of Hammerspace. “However that simply drove to the purpose of absurdity this problem of the truth that information is an emergent property of the infrastructure, and to have tiny little silos in each server would kill you.”

So he got down to rethink the elemental relationship between information and storage, and he got here to the conclusion that the reply to managing a various and dispersed storage infrastructure is metadata-driven administration layer atop a parallel file sytsem.

“It’s folly to suppose you may construct a sufficiently big, one-size-fits-all silo and stick the info in it,” Flynn instructed Datanami. “We’ve got to finally say information must transcend the storage system that maintain it. And the reply to doing that’s to decouple metadata from information and empower the system to be seen by means of the metadata and have the metadata handle the info throughout the infrastructure.”

That is primarily what Flynn and his crew have created at Hammerspace. The corporate has developed a parallel file system, based mostly on NFS model 4.2, that presents a single world namespace that permits enterprises to retailer information throughout a number of datacenters in an active-active, ultimately constant style. On prime of that, Hammerspace presents a metadata-driven administration layer designed to considerably reduces the burden on human directors to make sure information is pre-positioned the place customers can get it (in addition to deal with the snapshots, versioning, and safety of information).

What this implies is that customers can entry information wherever on this planet by means of a single world namespace, even when the info was initially saved on the opposite aspect of the world. “That implies that your information is, for the primary time, omnipresent throughout every of those and universally accessible, with all the conveniences of excessive efficiency, parallel file entry,” Flynn mentioned. “That’s what’s revolutionary.”

Hammerspace isn’t the primary distributed file system to show NFS or SMB new methods. In truth, the corporate’s parallel file system rides atop the parallelism constructed into NFS 4.2. One among Hammerspace’s co-founders is CTO Trond Myklebust, who’s Linux Torvald’s handpicked maintainer and lead developer for the Linux kernel NFS consumer. So anyone adopting that new file system, equivalent to through RHEL 8, can get that profit.

It’s the coupling of the parallel file system with the metadata-driven administration layer that basically units Hammerspace aside.

“That is actually two items. One is a parallel file system,” Flynn defined. “In case you’re conversant in the excessive efficiency computing [HPC] trade and what they do there to have the ability to attain large scale–these are proprietary and unique file methods that don’t have the enterprise reliability and have set. What we now have performed is we’ve taken NFS and the seeds of parallel NFS and enhanced that in order that NFS itself is usually a true parallel file system and nonetheless and assist the enterprise information providers snapshots, clones the RAS [reliability, availability, scalability], the enterprise functionality.”

It’s the second element—the metadata-driven administration layer–that basically allows enterprises to handle the fragmented nature of their information storage infrastructure, Flynn says.

“As a result of now your information is introduced by means of its metadata, your information is robotically managed and orchestrated by means of the metadata, and subsequently it decouples the info fully from the common storage infrastructure,” Flynn mentioned. “Individuals are very used to managing the info manually by establishing, by copying it themselves, or by establishing hyperlinks and instruments to repeat it. We’re transferring to a declarative mannequin the place they handle information by means of its metadata.”

The corporate’s file system has been usually accessible for a number of months and is in manufacturing at a few of the largest telecom and gaming corporations on this planet, says Molly Presley, who not too long ago joined the corporate as its SVP of selling.

“Primarily what we’re actually specializing in….are the industries which have the necessity for big scale datasets,” Presley mentioned, “whether or not that’s a couple of hundred terabytes to petabytes or a whole bunch of petabytes; a worldwide workforce, whether or not they’re full-time staff or their information customers are contractors; and the place their infrastructure is world.”

As an alternative of transferring giant quantities of information from an on-prem cluster into the cloud to allow customers to entry it and course of it, Hammerspace allows corporations to retailer information one time in its world namespace, after which allow customers on AWS, Google Cloud, or Microsoft Azure to entry it by spinning up a Hammerspace atmosphere on their non-public cloud situations, after which merely mounting their utility to the worldwide file system.

“Take into consideration all of those instances the place you’re transferring information. Perhaps it’s from the GPFS atmosphere up into the cloud,” Presley mentioned. “We don’t should make a number of copies of the info. The customers are interacting with it on the world namespace stage, so that you don’t have that added value of two, three, or 4 copies of those giant datasets, which is unmanageable and costly. That’s a part of what the info orchestration capabilities present, is not only transferring information round, however doing it in an environment friendly means, with the ability to heat the cache, in essence, for the applying or consumer who’s utilizing the info. Ensure you don’t have a number of copies as a result of the consumer and the applying are interacting with our namespace, not by means of discrete storage methods.”

Hammerspace CEO and co-founder David Flynn

The metadata-driven administration layer permits Hammerspace customers to customise how their information is distributed. For instance, customers can arrange a rule that claims sure items of information needs to be pre-positioned within the cloud, and the file system will robotically transfer that information to the cloud within the background. The rule might be outlined in a number of methods, equivalent to when it was final accessed, or any information that has been flagged.

“We’ve got each reactionary and on-demand. You’ll be able to merely mount the file system and begin accessing stuff and it’ll create situations of these recordsdata within the cloud. Consider it’s like caching it within the cloud,” Flynn mentioned. “In case you take the time to explain what subset of information goes to be wanted prematurely, the system can pre orchestrate it to already be resident.”

In both case, the file system is transferring information behind the scenes, eliminating the necessity for directors to do this work. That may be a paradigm shift in how information is managed, Flynn says.

“As an alternative of managing within the crucial by means of handbook strategies, we’re introducing for the primary time the language in metadata for it to be self-descriptive, for the info to be in cost, to say what it wants when it comes to its personal very existence,” he mentioned. “It’s like selecting up the info by its personal bootstraps and now you’re managing information throughout the variety of infrastructure. This can be a very massive paradigm shift and I’d liken it to managing servers within the digital versus managing servers by racking and stacking home equipment.”

Flynn says clients can get an enormous improve in information accessibility and efficiency by layering Hammerspace atop their current fleets of NAS units, together with NetApp, EMC Isilon, and Qumulo home equipment, “or something that speaks NFS V3.” “That earth-shattering,” he mentioned. “That’s by no means been performed earlier than that you should use your pool of NAS methods and even servers–our product can take servers with native flash and disk–and switch these into storage nodes, after which you may scale-out throughout them.”

Along with delivering HPC-like file system capabilities on Linux working methods that assist NFS 4.2, Hammerspace additionally gives legacy assist for NFS model 3 (as talked about above). The file system additionally helps the Home windows’ native file system, SMB, which accounts for about 60% of its clients, Flynn mentioned. It additionally helps S3 and block storage (i.e. SAN). Nearly the one factor it doesn’t assist is iSCSI and FC SCSI. “That’s rearview mirror. That’s wanting prior to now,” Flynn mentioned.

At present, Hammerspace introduced the supply of its International Knowledge Atmosphere, in addition to a number of government hires. That is first time the corporate has marketed the file system as a single world namespace versus a set of level options, Presley mentioned.

Along with hiring the Presley, a veteran of the enterprise and HPC cupboard space, Hammerspace introduced the hiring of Jim Choumas to be VP of channel gross sales and Chris Bowen to be SVP of world gross sales. “We’re actually happy to have these very expert senior executives becoming a member of the crew,” Flynn mentioned. “It exhibits I feel the effectiveness of the crew as a result of these are of us that basically know the trade nicely.”

Associated Gadgets:

The Way forward for Computing is Distributed

Roadmap to Distributed Knowledge Stewardship

Blurred Storage Strains: Clouds That Seem Like On-Prem

[ad_2]

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments