Pilosa looks to past to shape data's future

Fin-Tech

News Desk

January 31st, 2023

Why trust us

Advertiser Disclosure

Society’s ability to generate data at scale is well ahead of its ability to interpret it at the same rate, but a new company is changing that by looking at an old tool in a new way.

Launched in early May at the Oscon conference in Austin, Texas, Pilosa is a new generation of technology that decouples the index from data storage and optimizes it for massive scale by deploying a bitmap index on high-cardinality data, CEO Higinio Maycotte said.

Society is generating new data at a rate much faster than Moore’s Law. That volume makes it harder to interpret data at scale, as data retrieval technology has fallen beside that which generates it.

“It’s going to solve a major problem for everyone who works with data sets of one terabyte or more,” Mr. Maycotte said. “Pilosa makes a terabyte of data respond to queries as if it were 10 megabytes.”

Mr. Maycotte explained databases consist of two parts – storage and the index on which queries are run. Instead or residing within data stores, Pilosa sits on top of them. As a bitmap index, it uses less space so it can run in memory and not on disk.

Pilosa plans on using the power of the crowd to quickly offer solutions. It has introduced nine patents into the open source community where the technology develops three to five times as fast as anywhere else, Mr. Maycotte said.

“As open-source software, Pilosa is available today on GitHub. Our first version includes production-tested features, including single and multi-node index support, replication, Algorithm Plugins, Data Importer, and Basic Cluster Management. Customers can collaborate with us or pay a fee to add Pilosa to their stack and to access premium modules that we’ve built to further optimize performance.

“Our focus right now is on building a community around this software. Open-source projects live and die by the people who work on and around them.”

[caption id="attachment_52772" align="alignright" width="273"]

Pilosa processing speeds[/caption]

Scientific research involving proteins is a data intensive area, Mr. Maycotte said. Most existing models can only accommodate a small fraction of the actual proteins in the human body but scientists can employ Pilosa’s models and capture the entire data set.

“Genomic analyses can be completed in orders of magnitude faster,” Mr. Maycotte said.

Mr. Maycotte used the simple example of determining my favorite shirt colors. Pilosa turns that into a question by assigning a 1 or 0 to my like or dislike of every color. The binary system is highly compressible. Should someone want to know the favorite shirt colors of thousands or millions of people, this can easily be determined, along with a host of related factors.

“We want to sit on top of some of the largest data sets in the world,” Mr. Maycotte said. “Our pilot projects include moonshot initiatives like cancer research. Joining and asking questions of multiple whole genomes simultaneously is exactly the kind of work Pilosa was built to help accomplish.”

Contributors

News Desk

The latest news, comment and analysis from our crypto news desk.

Crypto investors seek new opportunity with Bitbot: Here's Why

Benson Toti

46% of Global Crypto Millionaires Attribute Their Wealth to Bitcoin

Elizabeth Kerr

Transak and Chiliz Join Forces to Simplify Crypto Access for Sports Fans

News Desk