New York, New York, United States
11K followers 500+ connections

Join to view profile

About

As a seasoned AI Engineer at LinkedIn, I bring a rich blend of experience and technical…

Services

Activity

Join now to see all activity

Experience & Education

  • LinkedIn

View Jason’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Licenses & Certifications

Publications

Courses

  • Applied Regression Analysis

    PHP 1511

  • Appropriate Technology

    ENGN 0930A

  • Artificial Intelligence

    CSCI 1410

  • Computer Science: Integrated Introduction

    CSCI0170

  • Data Engineering

    DATA 1050

  • Data Fluency for All

    CSCI0100

  • Data Science

    CSCI 1951A

  • Data Science: Machine Learning

    PH125.8x

  • Data and Society

    DATA 2080

  • Decision Theory: Foundations and Applications

    PHIL1550

  • Deep Learning

    CSCI 1470

  • Deep Learning and Special Topics in Data Science

    DATA 2040

  • Directed Research

    BIOL 1950

  • Hands-on Data Science

    DATA 1030

  • Honors Calculus

    MATH0350

  • Individual Independent Study

    CSCI 1970

  • Intermed Microeconomics(Math)

    ECON1130

  • Intro to Scientific Computing

    APMA0160

  • Introduction to Cognitive Science

    CLPS0200

  • Introduction to Modelling

    APMA 0200

  • Linear Algebra

    MATH0520

  • Logic

    PHIL0540

  • Making Decisions

    CLPS 0220

  • Methods of Applied Math I + II

    APMA0340

  • Methods of Applied Mathematics I, II

    APMA0330

  • Operations Research : Probabilistic Methods

    APMA 1200

  • Operations Research: Deterministic Models

    APMA1210

  • Principles of Biostatistics and Data Analysis

    PHP1510

  • Principles of Economics

    ECON0110

  • Quantitative Methods in Psychology

    CLPS0900

  • Reality Remix-Experimental VR

    DATA 1200

  • Sports in American Society

    AMST1600D

  • Statistical Inference

    APMA1650

  • Statistical Inference II

    APMA1660

  • Statistical Learning

    DATA 2020

  • Statistical Learning and Big Data

    PHP 2650

  • Statistics is Everywhere

    PHP0100

  • The Digital World

    CSCI 0020

Projects

  • Connect Four

    Implementation of the game Connect Four. Running the script produces an interactive game of Connect Four (in terminal) where you can play against another person, or against an AI Player that has been designed to think five moves ahead. All code is in the functional programming language Ocaml.

    See project
  • NBA Hackathon

    There were two questions that were asked for the 2017 NBA Hackathon:

    The first question asked to determine the probability that a team who always has a 80% chance to win never loses back to back games during an 82 game season. I found the answer of this question to be 5.88%, the answer was calculated four different ways:
    • Dynamic programming, to get the answer deterministically
    • An approximation using a derived formula
    • An approximation using large scale simulation
    • A…

    There were two questions that were asked for the 2017 NBA Hackathon:

    The first question asked to determine the probability that a team who always has a 80% chance to win never loses back to back games during an 82 game season. I found the answer of this question to be 5.88%, the answer was calculated four different ways:
    • Dynamic programming, to get the answer deterministically
    • An approximation using a derived formula
    • An approximation using large scale simulation
    • A clever use of pascal's triangles to find the exact probability

    The second question asked to find the exact date that each team was mathematically eliminated from playoff contention using the previous season's regular season results and all playoff tie breaking procedures. Using an object oriented approach, I was able to get the correct results. Code and further explanation available on Github.

    See project
  • NBA Shots Interactive Web App

    Github Repo:
    https://github.com/jasonk33/NBA-Shots-Interactive-Web-App

    Using R, I analyzed data from 10,000 NBA shots. The goal of the project was to model and predict field goal percentage. I discovered trends to increase field goal percentage by up to 20%. Using the Caret package, I performed high-level feature and model selection to create a Naive Bayes model that predicted shot precision with 65% accuracy. I created various visualizations to effectively display how field goal…

    Github Repo:
    https://github.com/jasonk33/NBA-Shots-Interactive-Web-App

    Using R, I analyzed data from 10,000 NBA shots. The goal of the project was to model and predict field goal percentage. I discovered trends to increase field goal percentage by up to 20%. Using the Caret package, I performed high-level feature and model selection to create a Naive Bayes model that predicted shot precision with 65% accuracy. I created various visualizations to effectively display how field goal percentage varied. I also used the package Shiny to build an interactive web tool to help coaches easily manipulate variables to see how effective shots were in different shooting zones. The tool can be accessed at:
    https://jasonk33.shinyapps.io/NBA_Field_Goals/

    See project
  • MLB Data Analysis

    Using R, I performed analysis on data from all MLB games dating backing to 1871. The project entailed heavy data cleaning, data manipulation, and the creation of several visualizations and predictive models. Visualizations include bar graphs to represent the home field advantage and road attendacne affects of each team, as well as a parcoord graph to display the relationship between various team statistics and overall team run production. Naive Bayes models were created to predict whether a…

    Using R, I performed analysis on data from all MLB games dating backing to 1871. The project entailed heavy data cleaning, data manipulation, and the creation of several visualizations and predictive models. Visualizations include bar graphs to represent the home field advantage and road attendacne affects of each team, as well as a parcoord graph to display the relationship between various team statistics and overall team run production. Naive Bayes models were created to predict whether a team had won and how many runs a team had scored based on statsitcs from the game.
    Results from the project can be viewed at: http://rpubs.com/jasonk33/237691
    Data was acquired from retrosheet.org

    Other creators
    • Brandon Dale
    See project
  • Predicting Stock Price Direction Using Neural Networks

    -

    • Acquired trading data with Google Finance API
    • Utilized NumPy and Pandas to create momentum based performance metrics
    • Constructed a deep neural network using TensorFlow to predict whether a stock price would rise or fall over a given time period

    See project
  • An Application of Markov Chains in Sports Rankings

    -

    There are many different ranking methods that can be used to predict the outcome of different sporting events. The focus of this project is the use of Markov chains for ranking teams in the NBA playoffs. The goal is to use the Markov method to make more accurate predictions of the outcomes of the playoffs based off of regular season performances than the NBA’s general rankings.

    See project
  • Data Science Job Scraping

    -

    In this project, all 20,000 data science jobs from Glassdoor were scraped and used to perform in depth analysis into the job market for data science. Individual skills, software, and qualification were analyzed using frequency and associated salaries to determine the value of different attributes from the point of view of the employer. A paper was also produced and is under review for publication at the Data Science Journal.

    See project
  • OpenML Machine Learning Evaluation

    -


    This project utilized the vast amount of data on OpenML to get insights into what methods for machine learning produce the most accurate classifiers. OpenML is a platform where people use different datasets and machine learning methods for classification tasks. The results from all the user uploaded runs are stored and made available for others to view. Analysis into which sklearn and weka primitives are the most accurate across the data was conducted. Additionally, dataset features based…


    This project utilized the vast amount of data on OpenML to get insights into what methods for machine learning produce the most accurate classifiers. OpenML is a platform where people use different datasets and machine learning methods for classification tasks. The results from all the user uploaded runs are stored and made available for others to view. Analysis into which sklearn and weka primitives are the most accurate across the data was conducted. Additionally, dataset features based on metadata were created and used to cluster datasets using kmeans. Analysis of how different types of datasets respond to different machine learning methods was investigated.

    See project
  • A Julia Package for the CAOS Algorithm

    -

    For this project, an official Julia package was created for the implementation of the CAOS algorithm (Characteristic Attribute Organization System). Testing was also conducted on the entire mitochondrial gene set to validate the algorithm.

    For more information, view the official paper here:
    https://drive.google.com/file/d/1LQLdC5V-aD703zsUSA7S8b9ImGdZi_cC/view?usp=sharing

    and the poster presentation…

    For this project, an official Julia package was created for the implementation of the CAOS algorithm (Characteristic Attribute Organization System). Testing was also conducted on the entire mitochondrial gene set to validate the algorithm.

    For more information, view the official paper here:
    https://drive.google.com/file/d/1LQLdC5V-aD703zsUSA7S8b9ImGdZi_cC/view?usp=sharing

    and the poster presentation here:
    https://drive.google.com/file/d/1jdNZGQ8Ak63_RPPrqTYTyyt-lRshJfvQ/view?usp=sharing

    See project

Honors & Awards

  • Joukowsky Scholar-Athlete Award

    Brown University

    The award is presented annually to two Brown students (one male and one female) who embody the best qualities of the scholar athlete by achieving success in both the classroom and in athletic competition—a reflection of Brown’s commitment to the totality of individual accomplishment.

  • Academic All-Ivy

    Ivy League Athletics

    Winter 2019

    https://ivyleague.com/news/2019/4/11/general-academic-all-ivy-winter.aspx

  • First Team All Ivy League 2019 (Long Jump)

    Ivy League Track and Field

  • First Team All Ivy League 2018 (Long Jump)

    Ivy League Track & Field

  • Second Team All Ivy League 2018 (Triple Jump)

    Ivy League Track & Field

Languages

  • English

    Native or bilingual proficiency

  • Spanish

    Elementary proficiency

Recommendations received

More activity by Jason

View Jason’s full profile

  • See who you know in common
  • Get introduced
  • Contact Jason directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses