JCypher: A Java Domain-Specific Language (DSL) for Cypher

Community Contributor
6 min read

An Introduction to the JCypher Java DSL
I first came into contact with the Neo4j graph database and the Cypher graph query language about two years ago. I was immediately fascinated and wanted to start working with it, so I decided to write a native Java DSL – a domain specific language for Cypher.
Adding Abstraction Layers via Domain Mapping
At this point in our model, we have access to graph databases at different levels of abstraction: with a query DSL at a lower level, and with a generic graph model at a somewhat higher level. The next question that naturally arises is: Are there more levels of abstraction that make sense? And the answer is yes. The next level of abstraction, which we call domain mapping, is to take an arbitrarily complex graph of Java objects, POJOs — Plain Old Java Objects, or what we’ll call domain objects – and store it to the graph database for later retrieval. You don’t have to modify your objects or their classes, add annotations or write a single line of mapping code or configuration because JCypher provides default mapping. At that same level of abstraction, you need to have the ability to query your graph of domain objects, which JCypher does by providing another Java DSL called Domain Query Language. With Domain Query Language, you can formulate queries based on concepts of your domain model rather than concepts of the underlying graph model. JCypher also provides some non-functional features such as transactions and concurrency support, which provides the ability for multiple clients to access a single database as well as multi-threaded access from within one client. Consider the following data model:

create
method, which returns a list of persons
that serve as root objects of the domain object graph.
Next, to access a graph database, we instantiate a DBaccess
object by means of a factory. We specify the database connection as remote and add some properties, most importantly the URL where we have the Neo4j server. Because we’re working with domain models and domains, we also need domain access — which we again create by means of a factory.
A business domain must have a unique name within a database, and in our case the name is People Domain
. Now with the domain access at hand, we simply call store given the domain objects and the “enter a graph of domain objects” is stored into the database.
Executing Domain Queries using JCypher
To formulate and execute a domain query, we create a query object from the domain access:
person
.
Next, we specify some constraints on that domain object match using WHERE
clauses. We specify the person(s) as having the last name “Smith” and the first name of “John.” Consecutive WHERE
clauses are ended by default. If you want to include “or,” you need to insert an or
clause as well as brackets to arbitrarily nest those expressions.
With this query, we are trying to determine who else lives at John Smith’s addresses, which we do by using a graph traversal clause. We start traversing from John Smith forward FORTH
via points of contact — which brings us to John Smith’s addresses — and then continue backward via points of contact to objects of type person
. In that way we have to find another DomainObjectMatch
for persons
which live at John Smith’s addresses. It’s that simple!
Next, we need to execute the query, and we can retrieve the actual result for every domain object match that we have specified in the query. In our case, we retrieve a list of persons
who live at John Smith’s addresses. And then we enter a query of our domain model.
In the next query, we want to know of those who live at John Smith’s addresses, how many of them live in Europe? In this next query, we only need to MATCH
for the object with a type “area” and the name “Europe”:

DomainObjectMatch
.
Now we have collected all areas of all Smith’s addresses. To complete the query, we use a SELECT
clause to select all of the Smiths for which the collected areas contain Europe. Next we execute the query and retrieve the actual result, which is a list of persons with the last name Smith who have addresses in Europe.
You can do a lot more interesting and powerful things with domain queries, which are all described in the project’s documentation and the distinct samples project. With all of those domain queries, you don’t have to worry about optimising database structure or mappings for query performance.
Also, because the graph of domain objects is backed by a graph database, navigating the database is really cheap. In contrast with relational databases, you need to optimize database structures almost on a per query basis, especially when it comes to navigating highly connected data.