Neo4j 2.3.0

Release Date: 21 October 2015

Neo4j 2.3 is a significant release, that provides read and write performance for massive operational throughput — making it our fastest and most scalable graph database yet.

Highlights

High concurrent performance at scale with fully off-heap cache
Improved Cypher performance with smarter query planner
Schema Enhancements: property existence constraints
String-enhanced graph search
Spring Data Neo4j 4.0
Official Docker support
PowerShell support
Mac Installer and Launcher
Easy 3rd party monitoring with Neo4j Metrics
View custom content in Neo4j Browser, for instance:
```
:play https://guides.neo4j.com/guide-library.html
```

Detailed changes

Cypher Queries

Added new warning mechanism. The first supported warning is generated for queries that contain a potentially unintended cartesian product
Fixed a MergeInto bug so that the direction of the relationship is considered when creating relationships
Generates a warning when a plan using LOAD CSV uses MATCH or MERGE on a large label without an index
Generates a warning when a plan using LOAD CSV also contains an EagerPipe
Introduce STARTS WITH, ENDS WITH and CONTAINS
ShortestPath no longer has a max limit of 15 steps
Generates a warning when using shortestPath without supplying an upper limit
CREATE from lists is now deprecated
Will now allow literals to be placed on the left-hand side of an inequality predicate also when using an index
MergeInto will now properly take direction into account when creating relationships
Allow arbitrary expressions for URL in LOAD CSV
Support for using compressed data with LOAD CSV
Added reverse function for reversing strings
Unfulfillable index hints (e.g. on non-existing indices) will now produce a warning or throw an error as configured
Now supports dynamic property lookup (f.e. using n[‘property’])
Usage of bare node patterns is now deprecated
Range queries can now be solved using index seeks
Prefer size() over length() for collections and pattern predicates
Accept lower case ‘e’ in Cypher exponent double literals
Added support for join hints: USING JOIN ON n
Added DETACH DELETE to Cypher
Fixes a bug where input positions could become wrong on Windows platforms
The compiler is now preloaded on db startup, minimizing delay of initial query
Triadic selection is now supported
Function exists() can now make use of index scans
LIMIT now properly influence planning costs
Generates a warning when dynamic property lookup makes index plans impossible
New constraint type: property existence for nodes and relationships
Inequality predicates can now be chained on the form: 0 < n.prop < 10
ShortestPath should now return paths that fulfil any predicates enforced upon it
The str() function is now deprecated
Cypher queries will no longer be logged in messages.log
Removed support for Cypher version 2.0 and 2.1, as announced
Produces error or warning if a query cannot be solved using the specified planner
Specifying a default Cypher planner in the settings now only applies to the default language version / parser
Cypher may now use full index scans to solve queries containing has(n.prop) if appropriate
Correctness fixes to Dijkstra regarding floating point precision, which can now be supplied by user
Optimizations to Dijkstra where it didn’t terminate as soon as it could have
Introducing bidirectional Dijkstra algorithm which often out-performs monodirectional Dijkstra
Shortest path algorithm supports result Path filtering

Kernel

Fix a bug that caused an incorrect page cache size to be reported when the dump_configuration setting was enabled.
Status codes of the ‘transient’ classification now cause transactions to be rolled back. This resolves several outstanding issues where deadlocks, network errors and other similar problems required clients to manually roll back transactions. Only ClientError and ClientNotification classified status codes leave open transactions running.
Adds checkpoints to the log, decoupled from log rotation. This allows for asynchronous flushing of the page cache down to storage, resulting in short-to-no pauses for write transactions that would previously occur around log rotation.
Adds PropertyContainer#getAllProperties(), returning a Map<String,Object> of all properties as a more convenient and more performant way of getting all properties of a Node or Relationship.
Object cache (i.e. node/relationship/property data cached as java objects in heap) has been removed. This reduces heap usage as well as code complexity, while for most use cases maintaining or surpassing performance levels of 2.2.x.

Cluster

No longer resolves hostnames into IPs. This should help in data center installations where a hostname might resolve into different IPs depending on which center the server is running in
Force-elect a demoted node if it is the only option. This happens predominantly in clusters where some members are slave-only.
No longer logs heartbeat messages since logging may incur I/O waits, which in extension may have an instance appear unavailable with the consequence that master switch happens.
Fixes an issue where an error during store copy from master might result in instance being forever unable to join cluster until manually restarted.
Give ongoing transactions a bit of time (10 seconds default) to complete before making a required role switch

Operations

Neo4j console does not fork a subprocess on Linux, enabling easier process management
Resolved Github Issue #5531: Added aliases for Host and Port. This stops the java command accidentally have multiple arguments on the command line e.g. -port 1337 -port 1337
Resolved Github Issue #5533: Added text in the example to explicitly state that the
Windows Service had already been installed
Resolved Github Issue #5536: Added link documentation about the NEO4J_HOME variable in the online manual
Adds a Windows PowerShell for managing a Neo4j

Configuration

Change the default dbms.pagecache.memory setting from 75% to 50% of free system memory after the JVM heap.
Client and transient errors cause transactions to be rolled back immediately, instead of merely be marked as failed with the purpose to fail the upcoming commit.
Configure http max header sizes with added configuration options:
- org.neo4j.server.webserver.max.request.header
- org.neo4j.server.webserver.max.response.header
Configuration for rotating the query log:
- dbms.querylog.rotation.threshold: Specifies at which file size the query log will auto-rotate
- dbms.querylog.max_archives: Specifies maximum number of history files to keep
With the removal of the object cache, the following settings have been removed:
- cache_type
- node_cache_size
- relationship_cache_size
- node_cache_array_fraction
- relationship_cache_array_fraction
- cache.memory_ratio
- high_performance_cache_min_log_interval

Tools

Neo4j Metrics

Introducing Neo4j Metrics, available for Neo4j Enterprise Edition
Captures basic metrics: JVM/memory, network activity and transactions
Feed available to consumers like Graphite, Ganglia or CSV

Neo4j Browser

New drawer for :config settings
:play https://guides.neo4j.com/guide-library.html can load guide content from URL
Add config to control outgoing connections (Enterprise Edition)
Display Cypher warnings in editor gutter
In EXPLAIN and PROFILE result frames, show Cypher warnings view if warnings exists
In database drawer: display only labels and rel types in use
Deprecate webadmin
Code view of Cypher result frames, displaying information about the HTTP request and response

Import-tool

Prints help information when no arguments were given
Root-cause-like stack traces are printed for unexpected errors, regardless of the value of --stacktrace that user supplied. That option still controls whether or not stack traces should be printed for common errors, which are usually accompanied by a more descriptive explanation.
Includes metadata about the store and stored transactions in diagnostics logging during startup.
Added ability to skip duplicate nodes, i.e. multiple nodes with the same ID in the same id space. Maximum total number of bad entries (nodes and relationships) during an import is still controlled using --bad-tolerance=<max bad entries>. Skipping of bad relationships/nodes can be enabled/disabled individually with --skip-duplicate-nodes and --skip-bad-relationships.

Consistency Checker

Additional check for node label id arrays that are not sorted
Redesign of record access pattern and added parallelization for faster and more scalable operation. The previous behaviour can still be had by adding --experimental=false

Known Issues

For map keys, not-null predicate does not work. Instead, please use the negation of is-null. Typically this occurs when using LOAD CSV. For example, the following predicate in the WHERE clause will not work correctly:

LOAD CSV WITH HEADERS FROM "https://some.valid.url/data.csv" 
WITH line
WHERE line.Company IS NOT NULL // Don't do this.
MERGE (c:Company {name: line.Company})

Instead, use this form of the predicate in the WHERE clause:

LOAD CSV WITH HEADERS FROM "https://some.valid.url/data.csv" 
WITH line
WHERE NOT line.Company IS NULL // Do this instead.
MERGE (c:Company {name: line.Company})

Upgrade Instructions

All Neo4j users are recommended to upgrade to this release. Upgrading to Neo4j 2.3.0 requires a migration to the on-disk store and can not be reversed. Please ensure you have a valid backup before proceeding. Production users should note that there will be configuration changes required, and that applications will need to be updated to support authentication of database requests. Please see our Upgrading to Neo4j 2.3 FAQ for further details. (Note: Neo4j 1.9 users may also upgrade directly to this release, and are recommended to do so carefully. We strongly encourage verifying the syntax and validating all responses from your Cypher scripts, REST calls, and Java code before upgrading any production system. For information about upgrading from Neo4j 1.9, please see our Upgrading to Neo4j 2 FAQ.)

Recent Database Releases

See All Database Releases →

Release Notes: Neo4j 2.3.0