Awesome Open Source
Awesome Open Source

Debian UDD into Graph Database

The scripts in this repository pull (some) data from the Ultimate Debian Database and convert them to a graph database, in particular Neo4j.

A detailed description of the process is available either on my blog (Part 1, Part 2, Part 3), or debian-package-neo4j.md.

The scripts here are:

  • pull-udd.pl queries the UDD and downloads the two tables for packages and sources. Needs DBI::PG Perl module. This scripts needs quite some time, as the server is not fast. Please be patient.
  • generate-graph is a Perl script that reads the two csv files generated from pull-udd.pl and generates csv files ready to be imported into Neo4j
  • sort-uniq.sh ensures that duplicate lines are removed from the csvs
  • build-db assembles the proper command line for neo4j-import
  • build-all glues everything together

After the scripts has run, the files generated are in debian-neo4j and can be used as Neo4j database files.

Comments and improvements are always welcome.

Copyright

Copyright 2017-2018 Norbert Preining

License: GPL3+

Alternatives To Debian Graph
Select To Compare


Alternative Project Comparisons
Related Awesome Lists
Top Programming Languages

Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Database (92,834
Script (62,350
Perl (30,700
Graph (24,416
Csv (15,149
Debian (8,766
Neo4j (3,330
Graph Database (921