Storagetapper

StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Alternatives To Storagetapper
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Awesome Postgres8,644
a month ago64cc0-1.0
A curated list of awesome PostgreSQL software, libraries, tools and resources, inspired by awesome-mysql
Wal E3,327
14a year ago33February 04, 202091bsd-3-clausePython
Continuous Archiving for Postgres
Pghoard1,213
1a month ago17May 27, 201942apache-2.0Python
PostgreSQL® backup and restore service
Gitlab Mirrors743
2 years ago19mitShell
A set of scripts adding the ability of managing remote mirrors to GitLab.
Zrepl602
12 days ago23February 15, 2022175mitGo
One-stop ZFS backup & replication solution
Replication Manager584
2 days ago24March 09, 2020100gpl-3.0Go
Signal 18 repman - Replication Manager for MySQL / MariaDB / Percona Server
Awesome Storage423
2 months ago6mit
A curated list of storage open source tools. Backups, redundancy, sharing, distribution, encryption, etc.
Zfs_autobackup391
2 months ago37July 04, 202217gpl-3.0Python
ZFS autobackup is used to periodicly backup ZFS filesystems to other locations. Easy to use and very reliable.
Storagetapper269
a year ago4November 19, 202121mitGo
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Pglogrepl172
a month ago11mitGo
PostgreSQL logical replication library for Go.
Alternatives To Storagetapper
Select To Compare


Alternative Project Comparisons
Readme

StorageTapper

Overview

Build Status Go Report Card codecov

StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service.

Storagetapper is deployed in production at Uber and used to produce snapshot and realtime changed data of thousands of MySQL tables across multiple datacenters.

It is also used as a backup service to snapshot hundreds of terrabytes of Schemaless data to HDFS and S3 with optional asymmetric encryption and compression.

It reads data from source transforms according to the specified event format and produces data to destination.

Supported event sources:

  • MySQL
  • Schemaless

Supported event destinations:

  • Kafka
  • HDFS
  • S3
  • Local file
  • MySQL (experimental)
  • Postgres (experimental)
  • Clickhouse (experimental)

Supported event formats:

  • Avro
  • JSON
  • MsgPack
  • SQL

Features:

  • Fields filtering
  • Snapshot row filtering
  • Producing realtime changes only, i.e. omitting initial snapshot
  • Snapshot and changelog events ordering
  • Periodic snapshots
  • Snapshot index hinting

Storagetapper keeps its jobs state in MySQL database and automatically distributes jobs between configured number of workers.

It is also aware of node roles and takes snapshot from the slave nodes in order to reduce load on master nodes. StorageTapper can also optionally further throttle its reads. Binlogs are streamed from master nodes for better SLAs.

Service is dynamically configurable through RESTful API or built-in UI.

Build & Install

Debian & Ubuntu

cd storagetapper
make deb && dpkg -i ../storagetapper_1.0_amd64.deb

Others

cd storagetapper
make && make install

Development

Linux

/bin/bash scripts/install_deps.sh # install all dependencies: MySQL, Kafka, HDFS, S3, ...
make test # run all tests
GO111MODULE=on TEST_PARAM="-test.run=TestLocalBasic" /bin/bash scripts/run_tests.sh ./pipe # individual test

Non Linux

make test-env
$ make test

Configuration

Storagetapper loads configuration from the following files and location in the given order:

    /etc/storagetapper/base.yaml
    /etc/storagetapper/production.yaml
    $(HOME)/base.yaml
    $(HOME)/production.yaml
    $(STORAGETAPPER_CONFIG_DIR)/base.yaml
    $(STORAGETAPPER_CONFIG_DIR)/production.yaml

Available options described in Configuration section

License

This software is licensed under the MIT License.

Popular Replication Projects
Popular Backup Projects
Popular Data Storage Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Go
Json
Mysql
Postgresql
Backup
Real Time
Kafka
Streaming
S3
Replication
Etl
Hdfs
Avro
Msgpack
Clickhouse