delta-io / delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
See what the GitHub community is most excited about today.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Open-source code analysis platform for C/C++/Java/Binary/Javascript/Python/Kotlin based on code property graphs. Discord https://discord.gg/vv4MH284Hc
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Protocol buffer compiler for Scala.
Apache Spark - A unified analytics engine for large-scale data processing
Berkeley's Spatial Array Generator
Spark: The Definitive Guide's Code Repository
Source code for the X Recommendation Algorithm
♞ lichess.org: the forever free, adless and open source chess server ♞
Human-AI Collaborative Data Science Using Visual Workflows
CLI tool for coding agents and developers to query the public API of any Maven JVM dependency — get symbol signatures, list packages, search by name, and inspect dependency trees. Powered by Coursier and tasty-query.