REDWOOD CITY, Calif.--(BUSINESS WIRE)--Paxata, the pioneer in self-service data preparation, today announced that the United States Patent and Trademark Office has granted the Company patent US20170109402A1 for Automated Join Detection across two diverse data sets. This feature is a key cornerstone in empowering non-technical business consumers to visually and interactively prepare, enrich and turn raw data into ready information at scale. Automatic Join Detection is foundational to the Paxata Adaptive Information Platform in that it uses Natural Language Processing (NLP), search techniques and other algorithms to identify candidate columns that could form the basis for joining two disparate data sets.
“The entire objective of self-service data preparation is for intelligent algorithms to do the heavy lifting. What can take weeks and months to do is now possible in a matter of seconds. That is the power of automated join detection,” said Prakash Nanduri, Co-Founder and CEO. “Finding, preparing and governing data that resides across the enterprise and multiple clouds is the key challenge of the day. Our vision has always been about enabling business users and intelligent systems to consume clean data easily and rapidly. This patent underscores Paxata’s position as the most powerful enterprise information platform in the industry. I congratulate the team, led by my co-founders Dave Brewster and Nenshad Bardoliwalla, who brought this to life.”
Unlike traditional approaches that rely on similarly named and formatted columns for join detection, Paxata will look for possible similarities in one or multiple columns from each of the data sets and propose a list of joins to the user with a confidence factor based on the completeness of the join operation. For example, one data set might have separate columns for First and Last names, while the second data set might have a column for Last name, First Name.
Beyond the intelligent algorithms used, Paxata’s patent also covers the ability to perform the processing for the Automatic Join Detection across distributed compute nodes, enabling it to process massive volumes of data, and not just on small sample data sets. Paxata also delivers this in a configurable manner via its intuitive user interface which allows the user to select various algorithmic options quickly and easily.
At Paxata, we transform data into information on-demand to empower every person, process, and system in the organization to be more intelligent. Our Adaptive Information Platform provides business leaders and analysts with an enterprise-grade, self-service data preparation application to deliver better customer experiences, improve operational efficiencies, and comply with regulatory requirements. Built on Apache SparkTM and optimized to run in hybrid, multi-cloud environments, Paxata leverages algorithmic intelligence and distributed computing to deliver an immersive business consumer experience that accelerates and automates the data-to-insight pipeline. Paxata is headquartered in Redwood City, California with offices in New York, Ohio, Texas, and Singapore. Visit www.paxata.com or engage with us on Twitter, LinkedIn, Facebook, or YouTube.