University of Bahrain
Scientific Journals

A new system for massive RDF data management using Big Data query languages Pig, Hive, and Spark

Show simple item record

dc.contributor.author Banane, Mouad
dc.contributor.author Belangour, Abdessamad
dc.date.accessioned 2020-02-29T23:59:32Z
dc.date.available 2020-02-29T23:59:32Z
dc.date.issued 2020-03-01
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/3781
dc.description.abstract The era of big data has emerged. The volume of generated data has never been greater. Massive quantities of data are stored on a huge number of servers that are inter-connected and share their storage space. Computation methods have been developed to perform computation operations directly on these machines, previously used mainly for storage. Tools such as Hive, Pig, and Spark provide the means for data query and analysis but are not suitable for Semantic Data. For this kind of data, a specialized tool called SPARQL is dedicated to query semantic data represented by the Resource Description Framework or RDF. The aim of our work is to transform a given SPARQL query into a Hive program, a Pig program or a Spark script according to the user's choice. To achieve this goal, we propose a Model Driven Approach which consists of creating a metamodel for each of these tools, to define a mapping between SPARQL metamodel on one hand and each of the previous Big Data query languages (Pig, Hive, and Spark). The transformation is then performed using Atlas Transformation Language or ATL. We conducted after that an experiment on three datasets containing a large volume of distributed RDF data on a powerful server cluster to validate our approach. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.rights Attribution-NonCommercial-NoDerivatives 4.0 International *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/4.0/ *
dc.subject Semantic Web,RDF, SPARQL, Big Data, Hive, Pig, Spark en_US
dc.title A new system for massive RDF data management using Big Data query languages Pig, Hive, and Spark en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/090211
dc.volume 9 en_US
dc.issue 2 en_US
dc.pagestart 259 en_US
dc.pageend 270 en_US
dc.contributor.authorcountry Morocco en_US
dc.contributor.authoraffiliation University Hassan II en_US
dc.source.title International Journal of Computing and Digital Systems en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Issue(s)

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 International Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International

All Journals


Advanced Search

Browse

Administrator Account