Algebraix Query Accelerator (AQUA) for Apache Spark
Powered by Data Algebra™
The Algebraix Query Accelerator (AQUA) is a software component for Spark SQL that lets you automatically provision computations of Spark SQL’s directed acyclic graph. AQUA leverages patented inter-query reuse technology to improve performance and reduce cloud infrastructure costs.
By applying AQUA to the Spark framework, developers and data scientists can use less expensive resources, fewer nodes, and shorten processing times to save total cost of ownership.
Whereas most SQL optimization techniques are focused on establishing adjacent data stores, AQUA optimizes the actual query execution plans from Spark’s catalyst. Our software uses Data Algebra to cache a variety of equivalent opportunities and subsequently removes work from Spark’s SQL jobs while maintaining the correct end computations.
AQUA is a simple to install software package that works in conjunction with Amazon Web Services, Elastic Map Reduce, and Amazon’s S3 filesystem. The application of our product requires no change to your current Spark scripts or queries.
The initial version of AQUA runs alongside Apache Spark to improve SQL performance and user concurrency in that environment; however AQUA is being developed for other database and big data cloud environments to include Microsoft Azure and IBM Bluemix.