Skip to content
View HyukjinKwon's full-sized avatar
🔥
🔥

Highlights

  • Pro

Organizations

@apache @databricks @cloudpipe @conda-forge @spark-korea @data-apis @py4j

Block or report HyukjinKwon

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. apache/spark apache/spark Public

    Apache Spark - A unified analytics engine for large-scale data processing

    Scala 43.5k 29.3k

  2. apache/arrow apache/arrow Public

    Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

    C++ 16.9k 4.2k

  3. adbc-driver-spark adbc-driver-spark Public

    Apache Arrow ADBC driver for Apache Spark Connect (Go core, C-ABI shared library, Python DBAPI). Arrow-native, multi-language.

    Go 1

  4. spark-connect-ruby spark-connect-ruby Public

    A pure-Ruby client for Apache Spark Connect: a PySpark-style DataFrame API over gRPC.

    Ruby 4 1

  5. spark-connect-scala3 spark-connect-scala3 Public

    A native Apache Spark Connect client for Scala 3: SparkSession/DataFrame/Column/functions API over gRPC.

    Scala 2

  6. pyspark-client-wasm pyspark-client-wasm Public

    PySpark in JupyterLite: run the real PySpark Connect client in the browser over grpc-web

    Python 3