HIVE + Impala Storage para Entidades

The platform allows to connect to HIVE or Impala and from a table to generate the ontology that represents it (remember that the ontology represents the abstraction of the persistence mechanism so that it makes me independent of the access giving me a SQL interface whatever the DB is, returning me JSON whatever the database is and giving me security and auditing).

For that from the Control Panel I will go to the option Create Ontology and from there I will select:

After this I will select the JDBC connection that applies to me (either HIVE or Impala).

If there is no connection and I have the access data I can create it from 

Once the connection is selected, a list of existing Tables will appear:

And selecting one will map the columns of the table and generate the JSON-Schema that represents the ontology:

After this, we will complete the definition of the ontology (at least the meta-inf and description part), we can even modify the name of the ontology and finally click on New.

After this we already have the ontology that represents my HIVE/Impala table. This ontology is now available for the rest of the Platform tools, either to compose my dashboards, to publish my APIS,...

In the example we see a query in the Query Tool. The platform uses Impala as query engine, which allows online access to tables with millions of records:

 

In Big Data scenarios it is typical to access HIVE/Impala tables directly from Spark.

The Platform Notebook allows you to do so:



Then, since a table is inferred as an ontology, you can see the data directly from all the operations performed on the table (NOTE: in Impala it is necessary to launch the invalidate metadata command).