Graph Databases

Graph databases treat relationships as first-class citizens: nodes and edges are stored natively, traversals run in constant time per hop, and multi-hop queries beat recursive SQL by orders of magnitude. Reach for them when the relationships are the data — social graphs, fraud rings, knowledge graphs, dependency maps, identity and access trees. For non-graph NoSQL engines (document, key-value, wide-column, time-series, search), see NoSQL Databases.


1. Production Graph Engines

Neo4j & ArangoDB

Two production graph engines compared: Neo4j’s Cypher property-graph model versus ArangoDB’s multi-model approach — with traversal patterns and query semantics.

Amazon Neptune

AWS managed graph DB — Gremlin and openCypher (property-graph) plus SPARQL (RDF). Includes Neptune ML for GNN training and Neptune Analytics for whole-graph algorithms.

JanusGraph

Distributed graph on Cassandra / ScyllaDB / HBase, indexed via Elasticsearch. Speaks Apache TinkerPop’s Gremlin. The standard choice when Neo4j scale isn’t enough.

Dgraph

Distributed native graph DB in Go — predicate sharding, GraphQL-first API, ACID transactions across the whole graph. Built on BadgerDB.

Related on this site

NoSQL Databases

The non-graph NoSQL landscape — in-memory caches, document stores, wide-column tables, key-value engines, time-series, and search. Pick by access pattern.

GraphQL

GraphQL the query language — typed schema, single endpoint, client-driven field selection. An API spec, often paired with graph or relational backends. Don’t confuse it with graph databases.

Vector Databases

Specialized similarity-search databases for embeddings. Often paired with knowledge graphs in modern RAG systems — embeddings retrieve, graphs traverse and reason.

BadgerDB

Pure-Go LSM key-value store — the storage layer underneath Dgraph. WiscKey value separation, ACID transactions.


2. Graph Databases — Advanced Technical Details and Usages

Graph databases overview diagram

2.1 Definition and Core Concept

A graph database is a database system designed to store, manage, and query data as a graph structure. It emphasizes relationships between entities as first-class citizens, rather than treating relationships as secondary constructs (such as foreign keys).

Graph databases are optimized for relationship traversal, pattern matching, and connected-data queries.

2.2 Graph Data Models

Property Graph Model

Common usage: social networks, fraud detection, recommendations

RDF (Resource Description Framework) Model

Common usage: knowledge graphs, semantic web, linked data

Labeled Property Graph (LPG)

2.3 Query Languages

Cypher

Gremlin

SPARQL

Native / Proprietary Languages

2.4 Storage and Architecture

Native Graph Storage

Distributed Graph Storage

In-Memory Graph Engines

2.5 Indexing Strategies

Indexes accelerate entry points, while traversals dominate query execution time.

2.6 Query Execution and Optimization

Traversal-Based Execution

Cost-Based Optimization

Path Explosion Control

2.7 Transactions and Consistency

Most production graph databases support full transactional guarantees.

2.8 Graph Analytics

Centrality Algorithms

Community Detection

Similarity and Path Algorithms

2.9 Common Use Cases

Recommendation Systems

Fraud Detection

Knowledge Graphs

Network and IT Operations

Identity and Access Management

2.10 Integration with Modern Data Platforms

2.11 Strengths and Limitations

Strengths

Limitations

2.12 When to Use a Graph Database

Graph databases are most effective when relationships are the data, not just attributes.


Widely Adopted (Enterprise & Production)

Open-Source / Infrastructure-Centric

In-Memory / Real-Time Graph Engines

RDF / Semantic Graph Databases

Cloud-Native Graph Offerings

These platforms represent the most commonly used graph databases across enterprise systems, open-source ecosystems, real-time applications, semantic technologies, and cloud-native architectures.


Twelve hand-picked talks and tutorials covering each engine on this page, plus the foundational graph-DB concepts behind them. Click any thumbnail to play in place. Roughly grouped: foundations + Neo4j (row 1), ArangoDB + Amazon Neptune intros (row 2), Neptune deep dive + JanusGraph + Dgraph (row 3).

What is a graph database? (in 10 minutes)

Neo4j — Ten-minute primer on nodes, edges, and why traversal beats recursive joins. The fastest way into section 2.1.

Neo4j in 100 Seconds

Fireship — Whirlwind 100-second tour of Neo4j and the property-graph model. A pre-flight check before the longer tutorials below.

Neo4j Course for Beginners

freeCodeCamp.org — Long-form Neo4j course covering install, the browser, Cypher patterns, and graph modeling. Pairs with sections 2–4.

Cypher, the Graph Query Language

Neo4j — Official walkthrough of Cypher's ASCII-art pattern syntax. Maps directly to section 2.3.1 (Cypher).

ArangoDB Tutorial

LearnCode.academy — Beginner-friendly tour of ArangoDB's multi-model approach (graph + document + key-value) and AQL. Sets up the ArangoDB card above.

ArangoDB Multi-Model Data Modeling

Arango (official) — Why one engine for graph, document, and key-value beats stitching three together. Background for section 2.2.3 (LPG) and the Neo4j vs. ArangoDB comparison page.

Introduction to Amazon Neptune

Amazon Web Services — Official intro to Neptune — Gremlin, openCypher, and SPARQL on one managed service. Pairs with the Amazon Neptune card.

Amazon Neptune deep-dive workshop

AWS Developers — Virtual workshop combining graph-DB fundamentals with a Neptune deep dive and live demo. Bridges sections 2 and the Neptune deep dive page.

AWS re:Invent 2020: Deep dive on Amazon Neptune

AWS Events — re:Invent 2020 deep dive on Neptune internals: storage, replication, query optimization. Reference for Neptune ML and Neptune Analytics.

Introduction to JanusGraph Database

The Linux Foundation — Jason Plurad (IBM) on JanusGraph's pluggable storage (Cassandra / HBase / ScyllaDB) and TinkerPop Gremlin. Maps to the JanusGraph card.

Dgraph in 100 Seconds

Fireship — 100-second intro to Dgraph — predicate sharding, GraphQL-first, written in Go. Pairs with the Dgraph card.

Dgraph - A distributed graph database written in Go

Singapore Gophers — GopherCon SG 2017 talk on Dgraph's architecture, distributed transactions, and the BadgerDB storage layer. The detailed companion to the Fireship clip.


↑ Back to Top