Kuzu to FalkorDB Migration

A streamlined 2-step process to migrate data from Kuzu graph database into FalkorDB using automated schema discovery and CSV export.

Overview

This migration tool bridges the gap between Kuzu and FalkorDB by:

Automatically discovering your Kuzu database schema
Exporting all nodes and relationships to properly formatted CSV files
Loading these CSV files into FalkorDB using the FalkorDB Rust loader

The process ensures complete data migration including nodes, relationships, properties, and metadata.

Features

Automatic Schema Discovery: Dynamically discovers all node types and relationship types in your Kuzu database
FalkorDB Compatibility: Generates CSV files in the exact format expected by FalkorDB
Intelligent Label Mapping: Maps Kuzu relationship names to standardized FalkorDB edge types
Complex Property Handling: Properly handles lists, nested values, and various data types
Comprehensive Export: Exports both nodes and relationships with full metadata
Schema Documentation: Optional JSON schema file generation for documentation purposes

Prerequisites

Python 3.6+
kuzu Python package
FalkorDB instance (local, Docker, or Cloud)
FalkorDB Rust Loader

Installation

Install the required dependencies:

pip3 install kuzu

Download the migration script:

git clone https://github.com/FalkorDB/Kuzu-to-FalkorDB.git
cd Kuzu-to-FalkorDB

Step 1: Exporting from Kuzu

Basic Usage

Export all data from a Kuzu database:

python3 kuzu_to_falkordb_export.py --db path/to/your/database

Advanced Usage

# Export with schema documentation
python3 kuzu_to_falkordb_export.py --db network_it_smart_db --schema schema.json

# Export to custom directory
python3 kuzu_to_falkordb_export.py --db network_it_smart_db --output my_csv_export

# Full example with all options
python3 kuzu_to_falkordb_export.py --db network_it_smart_db --schema schema.json --output falkordb_import

Command Line Options

Option	Description	Required	Default
`--db`, `--database`	Path to Kuzu database file/directory	Yes	-
`--schema`	Path to output schema JSON file	No	None
`--output`	Output directory for CSV files	No	`_csv_`

Output Structure

The export script generates the following files:

Node CSV Files:

Format: nodes_<NodeType>.csv
Structure: id,labels,property1,property2,...
Example: nodes_Application.csv, nodes_Machine.csv

Edge CSV Files:

Format: edges_<EdgeType>.csv
Structure: source,source_label,target,target_label,type
Example: edges_CONNECTS.csv, edges_CONTAINS.csv

Schema File (Optional):

File: schema.json
Contains: Export metadata, node types, relationship types, and file mappings

Example Export Output

🚀 Kuzu to FalkorDB CSV Exporter
==================================================
Database: network_it_smart_db
Output: _csv_
Schema: schema.json
==================================================

🔍 Discovering database schema...
  📦 Found node table: Application
  📦 Found node table: Machine
  🔗 Found relationship table: CONNECTS -> CONNECTS
  🔗 Found relationship table: INSTANCE_APP_SW -> INSTANCE
  ✓ Discovered 8 node types and 15 relationship types

📤 Exporting 8 node types...
  📦 Exporting Application...
    ✓ Exported 1,234 Application nodes to nodes_Application.csv

🔗 Exporting 15 relationship types...
  🔗 Exporting CONNECTS (from 3 Kuzu tables)...
    ✓ Exported 5,678 CONNECTS relationships to edges_CONNECTS.csv

🎉 Export completed successfully!
📁 Output files in: _csv_
📊 Exported: 8 node types, 15 relationship types

Step 2: Loading into FalkorDB

Use the high-performance FalkorDB Rust Loader to load the exported CSV files directly into FalkorDB.

Installation

git clone https://github.com/FalkorDB/FalkorDB-Loader-RS
cd FalkorDB-Loader-RS
cargo build --release

The binary will be available at target/release/falkordb-loader.

Basic Usage

After exporting your Kuzu database to CSV files, load them into FalkorDB:

./target/release/falkordb-loader my_graph

This command will:

Connect to FalkorDB (localhost:6379 by default)
Create the graph my_graph
Load all CSV files from the csv_output directory
Create indexes and constraints automatically

Advanced Usage

For more control over the loading process:

./target/release/falkordb-loader my_graph \
  --host localhost \
  --port 6379 \
  --username myuser \
  --password mypass \
  --csv-dir ./_csv_ \
  --batch-size 1000 \
  --merge-mode \
  --stats \
  --progress-interval 500

Command-Line Options

Option	Description	Default
`graph_name`	Target graph name in FalkorDB (required)	-
`--host`	FalkorDB host	localhost
`--port`	FalkorDB port	6379
`--username`	FalkorDB username (optional)	-
`--password`	FalkorDB password (optional)	-
`--csv-dir`	Directory containing CSV files	csv_output
`--batch-size`	Batch size for loading	5000
`--merge-mode`	Use MERGE instead of CREATE for upsert	false
`--stats`	Show graph statistics after loading	false
`--progress-interval`	Report progress every N records (0 to disable)	1000

Performance Features

The Rust loader provides significant advantages for loading Kuzu exports:

Async Operations: All database operations use async/await for better concurrency
Batch Processing: Processes multiple records per query (default: 5000)
Memory Efficient: Streams data from CSV files without loading everything into memory
Progress Tracking: Real-time progress updates during loading
Error Handling: Comprehensive error handling with detailed logging

Example Output

[INFO] Loading graph: my_graph
[INFO] CSV directory: _csv_
[INFO] Batch size: 5000
[INFO] Found 8 node files and 15 edge files

[INFO] Creating indexes...
[INFO] Creating constraints...

[INFO] Loading nodes...
[INFO] Loading nodes from nodes_Application.csv...
[INFO] Progress: 1000/1234 nodes loaded
[INFO] ✓ Loaded 1234 Application nodes

[INFO] Loading edges...
[INFO] Loading edges from edges_CONNECTS.csv...
[INFO] Progress: 5000/5678 edges loaded
[INFO] ✓ Loaded 5678 CONNECTS relationships

[INFO] Loading complete!

Performance Tips

Match export and loader directories: If you used --output my_csv_export during export, use --csv-dir my_csv_export when loading
Adjust batch size: For very large datasets, you might want to increase batch size: --batch-size 10000
Monitor progress: Use --progress-interval to get regular updates
Enable verbose logging: Set RUST_LOG=debug for detailed information
Use stats: Add --stats to see a summary of loaded data after completion

Data Mapping Features

Relationship Mapping

The script intelligently maps Kuzu relationship names to standardized FalkorDB edge types, ensuring consistent naming conventions.

Label Enhancement

The script enhances node labels with context for better FalkorDB compatibility:

Process nodes in different contexts: Application:Process, Service:Process, or OS:Process
Network zones: Network:Zone
Service software: Software:Service

Error Handling

The migration script includes robust error handling:

Validates database path exists
Handles missing relationship types gracefully
Continues export even if individual tables fail
Provides detailed progress and error messages

Troubleshooting

Common Issues

Database not found: Ensure the database path is correct and accessible
Permission errors: Check write permissions for the output directory
Memory issues: For very large databases, consider adjusting batch sizes or processing in chunks

Debug Mode

For additional debugging information, you can modify the script to include more verbose logging or add print statements to track the export process.

Additional Resources

Next Steps

Explore FalkorDB Cypher Language for querying your graph
Learn about FalkorDB Operations for production deployments
Check out FalkorDB Integration options