Apache Tajo Software Description
Apache Tajo is a robust data relational and distributed data warehouse system for Apache Hadoop that delivers interactive analytic capabilities on structured and semi-structured data residing. The data is available in HDFS as well as NoSQL datastores leveraging the power of SQL with rich extensions from JVM languages.
It is a complete data warehouse system, including data ingestion, reports, and OLAP reports. Tajo provides SQL and JVM extensions that can be used to manipulate data regardless of its source or format. The mission of Apache Tajo is to provide seamless SQL generation capabilities to all data sources, including relational, NoSQL, and semi-structured data sources, providing ease of use and interoperability without code generation.
Tajo can help bring paging and filtering to semi-structured data sources by means of native SQL queries, providing a mechanism to query large semi-structured data sets in a distributed environment. Tajo provides the ability to visually navigate through all the tables and to perform ad hoc reports on semi-structured data sets. It is possible to build a wide range of interactive web user interfaces using Java Swing components and XML templates. All in all, Apache Tajo is a great solution that you can consider among its alternatives.