Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Superset | 52,168 | 2 | 8 hours ago | 3 | April 29, 2022 | 1,358 | apache-2.0 | TypeScript | ||
Apache Superset is a Data Visualization and Data Exploration Platform | ||||||||||
Modin | 8,670 | 9 | 17 | 9 hours ago | 62 | June 25, 2022 | 903 | apache-2.0 | Python | |
Modin: Scale your Pandas workflows by changing a single line of code | ||||||||||
Trino | 7,865 | 3 | 9 | 9 hours ago | 51 | December 29, 2020 | 2,310 | apache-2.0 | Java | |
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) | ||||||||||
Mage Ai | 4,711 | a day ago | 9 | June 27, 2022 | 74 | apache-2.0 | Python | |||
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data. | ||||||||||
Sql Translator | 2,821 | 2 months ago | mit | TypeScript | ||||||
SQL Translator is a tool for converting natural language queries into SQL code using artificial intelligence. This project is 100% free and open source. | ||||||||||
Data Science Best Resources | 2,466 | 2 days ago | 5 | mit | ||||||
Carefully curated resource links for data science in one place | ||||||||||
Data Diff | 2,336 | 5 days ago | 16 | July 06, 2022 | 72 | mit | Python | |||
Compare tables within or across databases | ||||||||||
Data Science Question Answer | 2,239 | 8 months ago | 4 | mit | Jupyter Notebook | |||||
A repo for data science related questions and answers | ||||||||||
Quadratic | 1,980 | a day ago | 126 | mit | TypeScript | |||||
Quadratic | Data Science Spreadsheet with Python & SQL | ||||||||||
Awesome Business Intelligence | 1,747 | 2 months ago | 8 | mit | ||||||
Actively curated list of awesome BI tools. PRs welcome! |
Trino is a fast distributed SQL query engine for big data analytics.
See the User Manual for deployment instructions and end user documentation.
See DEVELOPMENT for information about code style, development process, and guidelines.
See CONTRIBUTING for contribution requirements.
See the project security policy for information about reporting vulnerabilities.
Trino is a standard Maven project. Simply run the following command from the project root directory:
./mvnw clean install -DskipTests
On the first build, Maven downloads all the dependencies from the internet
and caches them in the local repository (~/.m2/repository
), which can take a
while, depending on your connection speed. Subsequent builds are faster.
Trino has a comprehensive set of tests that take a considerable amount of time to run, and are thus disabled by the above command. These tests are run by the CI system when you submit a pull request. We recommend only running tests locally for the areas of code that you change.
After building Trino for the first time, you can load the project into your IDE
and run the server. We recommend using
IntelliJ IDEA. Because Trino is a standard
Maven project, you easily can import it into your IDE. In IntelliJ, choose
Open Project from the Quick Start box or choose Open
from the File menu and select the root pom.xml
file.
After opening the project in IntelliJ, double check that the Java SDK is properly configured for the project:
The simplest way to run Trino for development is to run the TpchQueryRunner
class. It will start a development version of the server that is configured with
the TPCH connector. You can then use the CLI to execute queries against this
server. Many other connectors have their own *QueryRunner
class that you can
use when working on a specific connector.
Trino comes with sample configuration that should work out-of-the-box for development. Use the following options to create a run configuration:
io.trino.server.DevelopmentServer
-ea -Dconfig=etc/config.properties -Dlog.levels-file=etc/log.properties -Djdk.attach.allowAttachSelf=true
$MODULE_DIR$
trino-server-dev
The working directory should be the trino-server-dev
subdirectory. In
IntelliJ, using $MODULE_DIR$
accomplishes this automatically.
If VM options
doesn't exist in the dialog, you need to select Modify options
and enable Add VM options
.
Start the CLI to connect to the server and run SQL queries:
client/trino-cli/target/trino-cli-*-executable.jar
Run a query to see the nodes in the cluster:
SELECT * FROM system.runtime.nodes;
Run a query against the TPCH connector:
SELECT * FROM tpch.tiny.region;