apachespark
Here are 8 public repositories matching this topic...
type-class based data cleansing library for Apache Spark SQL
-
Updated
Jun 23, 2019 - Scala
Connect to SQL Server using Apache Spark
-
Updated
Sep 10, 2016 - Scala
Link Prediction is about predicting the future connections in a graph. In this project, Link Prediction is about predicting whether two authors will be collaborating for their future paper or not given the graph of authors who collaborated for atleast one paper together.
-
Updated
Dec 10, 2019 - Scala
This project is a data processing application built with Apache Spark and Scala. This is designed to efficiently process, analyze and transform large datasets related to people data. It leverages Spark’s distributed computing capabilities to handle scalable data ingestion, cleaning and reporting. Shell scripts are included for hadoop deployment.
-
Updated
Jun 11, 2025 - Scala
-
Updated
Feb 17, 2023 - Scala
Run your first analysis project on Apache Zeppelin using Scala (Spark), Shell, and SQL
-
Updated
Feb 16, 2024 - Scala
Improve this page
Add a description, image, and links to the apachespark topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the apachespark topic, visit your repo's landing page and select "manage topics."