San Francisco, California, United States
5K followers 500+ connections

Join to view profile

About

As a Senior Data Scientist with extensive experience in the field of data science, I have…

Articles by Rahul

Activity

Experience & Education

  • Uber

View Rahul’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Licenses & Certifications

Volunteer Experience

  • Lions Clubs International Graphic

    Board Member

    Lions Clubs International

    - Present 8 years

    Social Services

    DALLAS POLICE DEPARTMENT 16TH ANNUAL HEALTH AND SAFETY FAIR
    ● Delegated tasks for managing the event logistics and helping the attendees

    SPORTS EXTRAVAGANZA
    ● Conducted various sports events and helped the special kids to perform in the sports events
    ● Contributed to the logistics of the event
    ● Helped in event tear down

    MINNIE'S FOOD PANTRY
    ● Organized the inventory for the 2017 Thanksgiving Give Away

  • Event Coordinator

    S'Prayas

    - 1 year 11 months

    Social Services

    S’Prayas leverages Syntel’s most important resource — our people — and their collective knowledge to help poor children enrich their lives. We volunteers contribute their time, skills and experience to work with local outreach programs to drive social change.

  • Head Of Public Relations

    Reverb- Technical Event at Symbiosis Institute of Technology,Pune

    - 7 months

    Science and Technology

    Reverb, being a Techno-Cultural fest, also staged competitions in an array of technical events in Electronics and Telecommunication, Civil, Computer-Science & IT and Mechanical engineering. Participants from various colleges were left mesmerized with the lineup of exciting and thrilling events.
    The Department of Electronics and Telecommunication conducted ‘Proelium Sapientia’ which organized events like ‘Messy Codes’ to check the coding ability of students, to ‘Robo-Rave’, a robot racing…

    Reverb, being a Techno-Cultural fest, also staged competitions in an array of technical events in Electronics and Telecommunication, Civil, Computer-Science & IT and Mechanical engineering. Participants from various colleges were left mesmerized with the lineup of exciting and thrilling events.
    The Department of Electronics and Telecommunication conducted ‘Proelium Sapientia’ which organized events like ‘Messy Codes’ to check the coding ability of students, to ‘Robo-Rave’, a robot racing event.

Courses

  • Business analytics with SAS

    -

  • Data Management

    -

  • Python with machine learning

    -

Projects

  • Relational Database model for Online movie ticket booking system- JusTicket

    - Present

    Relational Database model for Online movie ticket booking system- JusTicket

    • Created an Entity-Relationship diagram and Relational Database Schema of the database.
    • Implemented the database in Microsoft Access with MS Access front-end
    • Created data input screen forms for tables/views in Access.
    • Inserted synthetic data using the input screen forms
    • Designed report forms based on interesting queries.
    • Building a menu-driven environment for the database system using…

    Relational Database model for Online movie ticket booking system- JusTicket

    • Created an Entity-Relationship diagram and Relational Database Schema of the database.
    • Implemented the database in Microsoft Access with MS Access front-end
    • Created data input screen forms for tables/views in Access.
    • Inserted synthetic data using the input screen forms
    • Designed report forms based on interesting queries.
    • Building a menu-driven environment for the database system using Switchboard

  • Healthcare Fraud Analysis

    - Present

    Through extensive Exploratory Data Analysis & innovative logical approaches, we identified possible fraudulent claims from multiple linked Health Insurance Claim data sets. In the process, we also developed Business Rules for prevention of the same in the future, to minimize loss of revenue.
    We made use of SAS Enterprise Miner and Microsoft Excel for the same.

  • Analysis of Food Prices using Tableau

    -

    • Created the data connection, loaded the data and joined tables
    • Identified dimension and measures
    • Used different aggregation levels to get appropriate insights
    • Developed dashboard to identify countries having highest prices and the price fluctuation
    • Created a story to showcase the variation of food prices

  • Salary Prediction Model using R Programming

    -

    • Built a prediction model using C5.0 algorithm which was 87.94 % accurate and identified the most important variables
    • Cleaned the data, handled missing values and tackled the class imbalance problem using SMOTE function

  • Multiple Linear regression analysis and Principal component regression analysis on General Motors collected data on U.S. Standard Metropolitan Statistical Areas (SMSA’s)

    -

    Multiple Linear regression analysis and Principal component regression analysis on General Motors collected data on U.S. Standard Metropolitan Statistical Areas (SMSA’s) on air pollution contribution to mortality

    Multiple Linear regression analysis:

    • Performed Multiple Linear regression on original dataset keeping Mortality as Target variable
    • Analysed significant variables using hypothesis analysis where P-values are less than 0.05 or 5% and again performed regression…

    Multiple Linear regression analysis and Principal component regression analysis on General Motors collected data on U.S. Standard Metropolitan Statistical Areas (SMSA’s) on air pollution contribution to mortality

    Multiple Linear regression analysis:

    • Performed Multiple Linear regression on original dataset keeping Mortality as Target variable
    • Analysed significant variables using hypothesis analysis where P-values are less than 0.05 or 5% and again performed regression analysis keeping significant variables and investigated the regression summary and Fitness of the data model looking at a coefficient of Correlation and Determination and standard error.

    Principal component regression analysis:

    • Executed Principal Component Analysis on original dataset keeping Mortality as Target variable and evaluated 5 principal components
    • Performed Multiple Linear regression on selected 5 principal components
    • Analysed significant variables using hypothesis analysis where P-values are less than 0.05 or 5% and again performed regression analysis keeping significant variables and investigated the regression

    Conclusion :

    First regression model fails to determine whether air pollution is significantly related to mortality, because of HCPot, NocPot and SO2Pot are not significant as p-value is greater than 0.05 (or 5%), instead, it indicates the positive significance of %Non-White to Mortality.
    However, the principal component regression analysis strongly indicates air pollution is significantly related to mortality. The PC1, which has the highest Eigenvalue (4.0289) and contributes to a maximum proportion of 28.78%, is strongly correlated with the variables recording the pollution potential of three different air pollutants (HCPot, NocPot, and SO2Pot).

    Hence it determines air pollution is significantly related to mortality and we would prefer regression model obtained by considering significant principal components.

  • Descriptive analysis of housing data for king county, USA using SAS E-Miner and MS Excel - Principal Components Analysis (PCA)

    -

    This dataset contains house sales prices for king county, the USA which includes Seattle. It includes houses sold between May 2014 and May 2015. This dataset enables us to study and evaluate simple regression models. There are 21 variables in the dataset and 29000 records in the dataset.

    dataset Link : https://www.kaggle.com/harlfoxem/housesalesprediction

    • Performed statistical exploration using Excel and SAS E-Miner on raw datasets and conducted trend analysis
    • Analyzed…

    This dataset contains house sales prices for king county, the USA which includes Seattle. It includes houses sold between May 2014 and May 2015. This dataset enables us to study and evaluate simple regression models. There are 21 variables in the dataset and 29000 records in the dataset.

    dataset Link : https://www.kaggle.com/harlfoxem/housesalesprediction

    • Performed statistical exploration using Excel and SAS E-Miner on raw datasets and conducted trend analysis
    • Analyzed King County housing dataset by generating Principle component analysis and correlation matrix
    • Predicted housing prices on King County data multiple linear regression analysis

Honors & Awards

  • Dean's Excellence Scholarship

    JSOM

  • The Excellence Award - Outstanding Performance

    Mr. Jitendra Patil - Team Lead

  • The Excellence Award - Client Appreciation and Individual Accomplishment

    Mr, Vasim Inamdar - Project Lead

  • kudos award!!

    Ameriprise financial

    Kudos award for outstanding work done in the 1st year of joining and delivered gmwb project single handedly

Languages

  • English

    Full professional proficiency

  • Hindi

    Native or bilingual proficiency

  • Marathi

    Native or bilingual proficiency

Recommendations received

8 people have recommended Rahul

Join now to view

View Rahul’s full profile

  • See who you know in common
  • Get introduced
  • Contact Rahul directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Rahul Patil in United States

Add new skills with these courses