HIVE + Impala Storage para Entidades
The platform allows to connect to HIVE or Impala and from a table to generate the ontology that represents it (remember that the ontology represents the abstraction of the persistence mechanism so that it makes me independent of the access giving me a SQL interface whatever the DB is, returning me JSON whatever the database is and giving me security and auditing).
For that from the Control Panel I will go to the option Create Ontology and from there I will select:
After this I will select the JDBC connection that applies to me (either HIVE or Impala).
If there is no connection and I have the access data I can create it fromÂ
Once the connection is selected, a list of existing Tables will appear:
And selecting one will map the columns of the table and generate the JSON-Schema that represents the ontology:
After this, we will complete the definition of the ontology (at least the meta-inf and description part), we can even modify the name of the ontology and finally click on New.
After this we already have the ontology that represents my HIVE/Impala table. This ontology is now available for the rest of the Platform tools, either to compose my dashboards, to publish my APIS,...
In the example we see a query in the Query Tool. The platform uses Impala as query engine, which allows online access to tables with millions of records:
Â
In Big Data scenarios it is typical to access HIVE/Impala tables directly from Spark.
The Platform Notebook allows you to do so:
Then, since a table is inferred as an ontology, you can see the data directly from all the operations performed on the table (NOTE: in Impala it is necessary to launch the invalidate metadata command).