TY - JOUR
T1 - HyGraph
T2 - a subgraph isomorphism algorithm for efficiently querying big graph databases
AU - Asiler, Merve
AU - Yazıcı, Adnan
AU - George, Roy
N1 - Funding Information:
This research is funded in part by NSF, and NGC under Grant Numbers FAIN-1901150, and 2017–2007 respectively. Any opinions, findings, and conclusions expressed here are those of the author(s) and do not reflect the views of the sponsor(s).
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/12
Y1 - 2022/12
N2 - The big graph database provides strong modeling capabilities and efficient querying for complex applications. Subgraph isomorphism which finds exact matches of a query graph in the database efficiently, is a challenging problem. Current subgraph isomorphism approaches mostly are based on the pruning strategy proposed by Ullmann. These techniques have two significant drawbacks- first, they are unable to efficiently handle complex queries, and second, their implementations need the large indexes that require large memory resources. In this paper, we describe a new subgraph isomorphism approach, the HyGraph algorithm, that is efficient both in querying and with memory requirements for index creation. We compare the HyGraph algorithm with two popular existing approaches, GraphQL and Cypher using complexity measures and experimentally using three big graph data sets—(1) a country-level population database, (2) a simulated bank database, and (3) a publicly available World Cup big graph database. It is shown that the HyGraph solution performs significantly better (or equally) than competing algorithms for the query operations on these big databases, making it an excellent candidate for subgraph isomorphism queries in real scenarios.
AB - The big graph database provides strong modeling capabilities and efficient querying for complex applications. Subgraph isomorphism which finds exact matches of a query graph in the database efficiently, is a challenging problem. Current subgraph isomorphism approaches mostly are based on the pruning strategy proposed by Ullmann. These techniques have two significant drawbacks- first, they are unable to efficiently handle complex queries, and second, their implementations need the large indexes that require large memory resources. In this paper, we describe a new subgraph isomorphism approach, the HyGraph algorithm, that is efficient both in querying and with memory requirements for index creation. We compare the HyGraph algorithm with two popular existing approaches, GraphQL and Cypher using complexity measures and experimentally using three big graph data sets—(1) a country-level population database, (2) a simulated bank database, and (3) a publicly available World Cup big graph database. It is shown that the HyGraph solution performs significantly better (or equally) than competing algorithms for the query operations on these big databases, making it an excellent candidate for subgraph isomorphism queries in real scenarios.
KW - Exact matching algorithm
KW - Graph database
KW - Neo4j databases
KW - Query graph search
KW - Subgraph isomorphism problem
UR - http://www.scopus.com/inward/record.url?scp=85128640223&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85128640223&partnerID=8YFLogxK
U2 - 10.1186/s40537-022-00589-0
DO - 10.1186/s40537-022-00589-0
M3 - Article
AN - SCOPUS:85128640223
SN - 2196-1115
VL - 9
JO - Journal of Big Data
JF - Journal of Big Data
IS - 1
M1 - 40
ER -