-

Dremio Announces Support for Apache Arrow Flight High-Performance Data Transfer

New open source data connectivity interface co-developed by Dremio engineers enables data transfer that is more than 10 times faster than traditional data transfer APIs

SANTA CLARA, Calif.--(BUSINESS WIRE)--Dremio, the innovation leader in data lake transformation, today announced support for Apache Arrow Flight, an open source data connectivity technology co-developed by Dremio that radically improves data transfer rates. As a result, client applications can now communicate with Dremio’s data lake service more than 10 times faster than using decade-old technologies, such as Open Database Connectivity (ODBC) and Java Database Connectivity (JDBC).

The implementation comes as data scientists, engineers and architects scale their applications and need to exchange data across process boundaries in a fast and efficient way without making copies. As companies continue to implement machine learning models and become more data-centric and data-driven, they require high-speed access to data to be successful. Apache Arrow, an open source project co-created by Dremio engineers in 2017, is now downloaded over 20 million times per month. Arrow Flight enables Arrow-powered technologies, such as Dremio and Python data science libraries, to exchange data at network speeds without any serialization/deserialization overhead.

“Even as data volumes have increased by orders of magnitude, companies have had to continue to rely upon such archaic 25-year-old technologies like ODBC and JDBC for data transfer. While these technologies are fine for applications that require small datasets, they are a bottleneck for modern applications, such as machine learning, where millions of records are retrieved over the wire. Today we are announcing the availability of Arrow Flight in Dremio, which will open the door for new applications of data and set the performance standard for high-speed data transfer in the modern enterprise,” said Tomer Shiran, founder and chief product officer at Dremio.

In addition to superior performance, Arrow Flight offers many other benefits. Arrow Flight is cross-platform and has multi-language support including Python, Java and C++, with others to come. As an example, data scientists can retrieve data directly from a Flight-enabled database like Dremio into a Python dataframe without having to extract the data into local files on the client.

The ability to avoid data extracts, combined with Arrow Flight’s wire-level encryption and authentication capabilities, enables companies to overcome data governance and security challenges. Since data is being consumed directly from the centralized IT-controlled database or data lake service, data teams can control and monitor access to the data and delete records when necessary to comply with GDPR and CCPA requirements, such as “the right to be forgotten.”

Arrow Flight is now available as part of the Apache Arrow 3.0 release. To learn more, Dremio will be hosting a webinar, “Eliminate Data Transfer Bottlenecks with Apache Arrow Flight,” on Thursday, Feb. 25 at 10 a.m. PT / 1 p.m. ET and you can register to attend here.

About Dremio

Dremio reimagines the cloud data lake to deliver faster time to analytics by eliminating the need to copy and move data to proprietary data warehouses, or create cubes, aggregation tables and BI extracts. A self-service semantic layer provides flexibility and control for data architects, and self-service for data consumers. Founded in 2015, Dremio is headquartered in Santa Clara, CA. Investors include Cisco Investments, Insight Partners, Lightspeed Venture Partners, Norwest Venture Partners, Redpoint Ventures and Sapphire Ventures. For more information, visit www.dremio.com. Connect with Dremio on GitHub, LinkedIn, Twitter and Facebook.

Contacts

Gillian Roberts
gillian.roberts@aircoverpr.com
(818) 395-2948

Dremio


Release Versions

Contacts

Gillian Roberts
gillian.roberts@aircoverpr.com
(818) 395-2948

More News From Dremio

Dremio Announces Open and Forever-Free Lakehouse Platform, Dremio Cloud, on AWS

SANTA CLARA, Calif.--(BUSINESS WIRE)--Dremio, the lakehouse company, today announced the general availability of Dremio Cloud, the world’s first free data lakehouse platform and the addition of two new services: Dremio Sonar, a lakehouse engine built for SQL, and Dremio Arctic, a metadata and data management service for Apache Iceberg that provides a unique Git-like experience for the lakehouse. Dremio Sonar is now generally available, and Dremio Arctic is in public preview. “Dremio Cloud is th...

Dremio Doubles Valuation to $2 Billion With $160M Investment Towards Reinventing SQL for Data Lakes

SANTA CLARA, Calif.--(BUSINESS WIRE)--Dremio, the innovation leader in data lake transformation, today announced it has closed a $160 million Series E funding round, bringing the company’s valuation to $2 billion. This preemptive fundraising occurred just one year after a $135 million round led by Sapphire Ventures in January 2021. The new fundraising round is led by Adams Street Partners with participation from existing Dremio investors: Sapphire Ventures, Insight Partners, Lightspeed Venture...

Dremio Continues to Reduce the Zone of Confusion Between Data Lakes and Data Warehouses with New Dart Initiative Release

SANTA CLARA, Calif.--(BUSINESS WIRE)--Dremio, the SQL Lakehouse Platform company, today achieved another milestone in closing the gap between cloud data lakes and cloud data warehouses. Today’s release marks the second delivery in the company’s Dart Initiative, which enables customers to run all mission-critical SQL workloads directly on the cloud data lake. Dremio embarked on the Dart Initiative in June 2021 to help companies run a greater range of mission-critical BI workloads directly on the...
Back to Newsroom