San Francisco Bay Area
4K followers 500+ connections

Join to view profile

About

Current Affiliation/ Title :
ASIC RTL Design Engineer - Shader Processor (GFX), AMD…

Activity

Join now to see all activity

Experience & Education

  • AMD

View Arjun’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Volunteer Experience

  • World Wildlife Fund Graphic

    Contributing Member

    World Wildlife Fund

    - Present 14 years 7 months

    Animal Welfare

    Contributions in 2017 -

    1. Signed a petition for the Vaquita campaign assigned to permanently ban the use of gillnets in the Northern Gulf of California and increase the enforcement of illegal fishing in the area
    2. Supported the petition to help boost The Multinational Species Conservation Funds to protect the planet's remaining populations of elephants, rhinos, tigers, great apes and marine turtles by selling more than 38 million 'Save Vanishing Species' Semipostal stamps and…

    Contributions in 2017 -

    1. Signed a petition for the Vaquita campaign assigned to permanently ban the use of gillnets in the Northern Gulf of California and increase the enforcement of illegal fishing in the area
    2. Supported the petition to help boost The Multinational Species Conservation Funds to protect the planet's remaining populations of elephants, rhinos, tigers, great apes and marine turtles by selling more than 38 million 'Save Vanishing Species' Semipostal stamps and raising $4.1 million

  • Los Angeles 2028 Graphic

    Volunteer

    Los Angeles 2028

    - Present 8 years 9 months

    Social Services

    As one of 18 million talented, creative and caring Angelenos, I volunteer to serve the local communities while assisting them in hosting the Olympic & Paralympic Games for 2028. Together, we use the power of the Games to make Los Angeles an even better place to live for all Angelenos, well before 2028. My involvement not only reflects Los Angeles's passionate Olympic spirit and commitment to improving our community, importantly, it also demonstrates our readiness to support the Games in 2028.

  • Intel Corporation Graphic

    PC Pals Volunteer

    Intel Corporation

    - 4 years 8 months

    Education

    - Correspond with local students (Sutter Middle School, Folsom, CA) via e-mail and serve as a weekly resource for advice and encouragement
    - Coach students through everything from current coursework to applying for colleges.

  • Environmental Defense Fund Graphic

    Freelance Contributor

    Environmental Defense Fund

    - 5 years 5 months

    Environment

    Contributions in 2017:
    1. Repealed federal standards regulating oil and gas climate pollution on federal and tribal lands.
    2. Unilaterally suspended safeguards that reduce methane and other pollution limits from new and updated oil and gas facilities
    3. Helped in propagating California's renewed commitment to reducing greenhouse gas emissions by extending its cap-and-trade program through 2030.

  • USC Viterbi School of Engineering Graphic

    Viterbi Graduate Student Mentor

    USC Viterbi School of Engineering

    - 1 year 2 months

    Social Services

  • National Institute of Technology Karnataka, Surathkal Graphic

    Peer Mentor for DSD group

    National Institute of Technology Karnataka, Surathkal

    - 4 months

    Science and Technology

    Involved in the mentoring of students from all engineering disciplines at NITK in the field of Digital System Design using HDL and help them realize subsequent applications of Digital Design Prototyping on FPGAs in broad areas of engineering such as control systems, autonomous and actuating systems, computer arithmetic and architecture, energy harvesting and digital signal processing

  • National Institute of Technology Karnataka, Surathkal Graphic

    Teaching Assistant

    National Institute of Technology Karnataka, Surathkal

    - 1 year

    Science and Technology

    Involved in the mentoring and assistance of Freshmen Undergraduate Students for the course, Elements of Electronics and Communication Engineering (EC110) offered by the Department of Electronics and Communication Engineering at National Institute of Technology Karnataka, Surathkal

  • Student Member

    Indian Coast Guard Coastal Clean Up Drive

    - Present 14 years 7 months

    Environment

  • HelpAge India Graphic

    Contributing Member

    HelpAge India

    - 8 years 11 months

    Social Services

  • Contributing Member

    National Association for the Blind

    - 6 years 11 months

    Social Services

Publications

Courses

  • Active Filter Design

    VL830

  • Advanced Computer Architecture

    VL722

  • Analysis of Algorithms

    CSCI570

  • CMOS VLSI Design

    EE477L (Passed)

  • Communication/ Computer Networks

    EC363

  • Computer Systems Architecture

    EE557

  • Computer Systems Organization

    EE457

  • Cryptography and Cryptanalysis

    CE823

  • DSP Systems and Architecture

    EC408

  • Diagnosis and Design of Reliable Digital Systems

    EE658

  • Digital Signal Processing

    EC362

  • Digital Systems Design - Tools and Techniques

    EE560

  • Directed Research: EE (Bogdan/ Nazarian)

    EE590

  • Embedded Systems

    EC411

  • Introduction to Nano-Fabrication Lithography

    EE508

  • Linear Integrated Circuits

    EC310

  • Low Power VLSI Design

    VL821

  • Major Project - I and Major Project - II

    EC449, EC499

  • Mathematics for Communication Engg

    CE721

  • Speech and Image Processing

    EC410

  • VLSI Design and Architecture

    EC361

  • VLSI System Design - Part I

    EE577A

  • VLSI System Design - Part II

    EE577B

Projects

  • Implementation of a Logical Simulator, Pre-Processor, Fault Simulators and Automatic Test Pattern Generators for Combinational Circuits

    -

    The team implemented two different Automatic Test Pattern Generators (based on D-algorithm and PODEM), two different fault simulators (parallel and deductive), and a pre-processor.

    Other creators
  • Implementation of a Multi-core Multi-threaded CPU with MOOESI based Cache Coherence

    -

     Phase 1: Wrote assembly codes for a parallel program to run on different threads on a core using lock and lockless mechanisms to maintain sequential consistency and coherency
     Phase 2: Implemented certain design aspects of the processor core such as thread scheduler, WB-after stage, forwarding to ID/TS and ran the parallel program on the CMP synthesized on the Spartan-6 FPGA

    Other creators
  • Dual Clock FIFO Design with Synchronous SRAM (BRAM) as an interface between different modules on an SoC

    -

     Implemented a single clocked FIFO design using N and N+1 bit pointers and used it to develop an even-odd sequence arrangement
     Dual clock FIFO design with double synchronization to reduce MTBF and eliminate clock domain crossing
     Synchronized gray-coded pointers and tapped data from appropriate points to suit SSRAM design

    Other creators
  • Out of Order Executing Tomasulo Processor with Copy Free Checkpoints and In-Order Commitment

    -

     Phase 1: Described the Branch Prediction Buffer, Issue unit, Free-Register List corresponding to the Physical Register file, Store Buffers and Store Address Buffers for the out-of-order executing Tomasulo Processor in VHDL
     Phase 2: Wrote the RTL (in VHDL) for the Dispatch unit, Re-Order Buffer and Copy-Free Checkpoint mechanism with Dirty Bit Arrays to model FRATs and flush wrong path instructions in case of branch mis-prediction
     Phase 3: Synthesized the out-of-order executing…

     Phase 1: Described the Branch Prediction Buffer, Issue unit, Free-Register List corresponding to the Physical Register file, Store Buffers and Store Address Buffers for the out-of-order executing Tomasulo Processor in VHDL
     Phase 2: Wrote the RTL (in VHDL) for the Dispatch unit, Re-Order Buffer and Copy-Free Checkpoint mechanism with Dirty Bit Arrays to model FRATs and flush wrong path instructions in case of branch mis-prediction
     Phase 3: Synthesized the out-of-order executing Tomasulo processor on the Artix-7 FPGA using XST on Xilinx ISE, real time debug using Chipscope and verified the working of the dumped processor for different instruction streams

    Other creators
  • Design of a MIPS 5-stage pipelined CPU with flow through and pipelined synchronous SRAMs (BRAM)

    -

    A 5 stage linear pipelined MIPS CPU design was designed that tackled control, data and name dependencies. Synchronous SRAMs were used to instantiate the Instruction Memory and Data Memories (BRAM instantiations by Xilinx) and therefore, the design seemed to convert the 5-stage pipeline to an inefficient 7-stage (for flow-through BRAMs) and 9-stage (for pipelined BRAMs) pipeline. However, using the concept of shadow and keep registers, the design was converted back to an efficient 5-stage…

    A 5 stage linear pipelined MIPS CPU design was designed that tackled control, data and name dependencies. Synchronous SRAMs were used to instantiate the Instruction Memory and Data Memories (BRAM instantiations by Xilinx) and therefore, the design seemed to convert the 5-stage pipeline to an inefficient 7-stage (for flow-through BRAMs) and 9-stage (for pipelined BRAMs) pipeline. However, using the concept of shadow and keep registers, the design was converted back to an efficient 5-stage pipeline that modeled the modifications made by the BRAM.

    Other creators
  • Master and Slave Interface with Router Design for an AXI Interconnect Protocol based Chip Multi-Processor

    -

     Interfaced processor tiles with memory using an AXI interconnect with deterministic routing in a mesh network

     Created a master and slave interface between router and processor/ memory to generate/ receive/ reorder packets

     Requests from different processors completed out-of-order and requests from same processor completed in-order

    Other creators
  • Performance analysis and Area-Power-Resource Estimation and Optimization on Out-of-Order Superscalar Processors using SimpleScalar, Cacti and hardware complexity estimation tools

    -

    In this project, an existing out-of-order super-scalar processor simulator, SimpleScalar, reads a machine configuration file and generates a processor model that matches the requirements specified in the configuration file. The configuration file specifies parameters such as the size and associativity of caches, number of Reservation Stations, number of functional units etc. In the SimpleScalar simulator, the functionality of Reservation Station and ROB are merged into a single entity called…

    In this project, an existing out-of-order super-scalar processor simulator, SimpleScalar, reads a machine configuration file and generates a processor model that matches the requirements specified in the configuration file. The configuration file specifies parameters such as the size and associativity of caches, number of Reservation Stations, number of functional units etc. In the SimpleScalar simulator, the functionality of Reservation Station and ROB are merged into a single entity called Register Update Unit (RUU).

    Cacti is a tool that estimates the access time and power consumption of any structure that stores data, such as the RUU and cache structure. The input to Cacti is the size of the structure, cache associativity and process technology used in building the structure. Cacti is used to compute the latency of the RUU and caches. Finally, a simple area estimation tool is used that is provided as an Excel spreadsheet.

    Two benchmark programs are used from the SPEC CPU2000 benchmark - gcc for integer benchmarks and mesa for floating point benchmarks.

  • Custom made RTL Design, Synthesis, Auto P&R, STA and Verification for a 4-Core Single-threaded Tiled "Gold" Chip Multiprocessor

    -

    The goal was to generate a layout and verify the 4-core Multi-threaded Tiled "Gold" Chip Multiprocessor (CMP) using some of the CAD tools typically used in ASIC Design for performing netlist synthesis, static timing analysis, gate-level simulation, logical equivalence checking, place & route, and back-annotated simulation - namely, Design Compiler, PrimeTime, Conformal and Velocity (Encounter). Most of the time was spent on simply obtaining a synthesizable design and optimizing the area-delay…

    The goal was to generate a layout and verify the 4-core Multi-threaded Tiled "Gold" Chip Multiprocessor (CMP) using some of the CAD tools typically used in ASIC Design for performing netlist synthesis, static timing analysis, gate-level simulation, logical equivalence checking, place & route, and back-annotated simulation - namely, Design Compiler, PrimeTime, Conformal and Velocity (Encounter). Most of the time was spent on simply obtaining a synthesizable design and optimizing the area-delay product of the synthesized design. The synthesized netlist targets the NCSU 45nm standard cell library.

    We described the Hardware for the following parts of the CMP using Verilog -

    - "Gold" Network on Chip Bidirectional Ring Router with ability to avoid packet collision using virtual channels on even-odd clock polarity and round robin priorities

    - "Gold" variable width Microprocessor with ability to counter data, control, structural and name dependencies for a custom made "Gold" ISA

    - "Gold" Network Interface Component to enable interactions between the microprocessor and the Router for thread switching applications

    - "Gold" CMP - The integration of all the above components in a structural and hierarchical manner

    Other creators
  • Performance Evaluation of Static and Dynamic Branch Predictors on a Benchmark using PinTool

    -

    Building up libraries for static taken and not taken branch predictors as well as dynamic bimodal and two level predictors using C++ and comparing their behavior using performance evaluation metrics such as prediction rates.

  • Implementation of a 5 stage linear pipelined CPU using Verilog HDL

    -

    - Implemented a 5-Stage Design with Instruction Fetch (IF), Instruction Decode (ID), Execution stage 1 (EX1), Execution stage 2 (EX2), and Write Back Stage (WB) stages to execute basic arithmetic functions
    - More custom functions were implemented showcasing the basic concepts of pipelining such as data-stationary control, forwarding and stalling
    - Data dependencies were taken care of by designing an appropriate forwarding unit (FU) and a hazard detection unit (HDU)

    Other creators
  • Design of a Pipelined General Purpose Microprocessor using Software and Hardware Components

    -

    A general purpose Multi-Cycle CPU can support simple Instructions such as Add, Subtract, Bitwise operations, Store Word, and Load Word. A simple structure of a CPU contains Decoding Logic, Register File, Execution Units, Memory, and other surrounding circuitry. The design included a 32-bit ALU with dynamic operation, a 1024-bit SRAM to load and store data from the memory and 8 32-bit registers as part of the register file.

    In this project, we try to implement a simple general purpose…

    A general purpose Multi-Cycle CPU can support simple Instructions such as Add, Subtract, Bitwise operations, Store Word, and Load Word. A simple structure of a CPU contains Decoding Logic, Register File, Execution Units, Memory, and other surrounding circuitry. The design included a 32-bit ALU with dynamic operation, a 1024-bit SRAM to load and store data from the memory and 8 32-bit registers as part of the register file.

    In this project, we try to implement a simple general purpose CPU with the knowledge that we gained in the course EE577A, which included adders, multipliers and SRAMs. Aided by scripting in Perl for the EE577A labs, we aim at optimizing the area, delay, and power.

    Note that the CPU designed in this final project may not be exactly the same as the ones we learnt in our computer architecture courses. One main component of our course EE577A is optimization for delay and power, and therefore in addition to functionality correctness, we were supposed to implement at least one of the circuit blocks using dynamic logic. Also we need to take at least one power optimization measure learnt and we used clock gating. For delay optimization, we used minimization of critical path delay using logical effort.

    Other creators
  • Malicious Activity Prediction in Public Surveillance based on Real Time Video Acquisition

    -

    Major project as part of course curriculum - The major project tries to replicate, recreate and prototype "The Machine" from the hit network TV series "Person of Interest" which initially looks at teaching a central arbiter, the classification of real time videos and prediction of occurrence of cataclysmic events based on the model we develop. For now, a prototype on prediction of criminal/ suspicious activities ranging from petty thefts to hazardous terror attacks in public surveillance…

    Major project as part of course curriculum - The major project tries to replicate, recreate and prototype "The Machine" from the hit network TV series "Person of Interest" which initially looks at teaching a central arbiter, the classification of real time videos and prediction of occurrence of cataclysmic events based on the model we develop. For now, a prototype on prediction of criminal/ suspicious activities ranging from petty thefts to hazardous terror attacks in public surveillance systems has been abstracted.

    Other creators
  • High-speed Camera with Embedded Shape Recognition

    -

    The goal of this project is to develop on an FPGA, a high-speed algorithm to track shapes (mainly circle based) and calculate their barycentres. Based on an existing camera working with infra-red light at 1280x1024 pixels and 150 frames per second, the embedded FPGA acquires the image and find some shapes. This camera is then used for motion tracking and sends through USB, the coordinates of the detected shapes. The algorithm works with low latency and tracks a moving point. It well…

    The goal of this project is to develop on an FPGA, a high-speed algorithm to track shapes (mainly circle based) and calculate their barycentres. Based on an existing camera working with infra-red light at 1280x1024 pixels and 150 frames per second, the embedded FPGA acquires the image and find some shapes. This camera is then used for motion tracking and sends through USB, the coordinates of the detected shapes. The algorithm works with low latency and tracks a moving point. It well differentiates small dots from bigger shapes and detects blinking points. It calculates the blinking frequency of each dot. Algorithms are developed in VHDL, simulated and tested in real condition using the camera and the JTAG debugging interface. Each data extraction is done in less than one frame.

    A current working camera with an FPGA exists. A black and white image is stored into the embedded RAM of the FPGA. The communication with the USB is already implemented with a Qt visual application allowing to capture grey level images and get computed data.

    Technologies/ Methodologies used:

    * CMOS based Camera design using an FPGA
    * FPGA - Xilinx Spartan-6
    * USB
    * Embedded Tracking Algorithms
    * Shape Recognition
    * Blob Detection and Labeling Analyses
    * Shape Barycenter Calculation
    * Qt visual application development

  • Implementation of an Area Efficient 32x32 bit Multiplier based on Multi-Radix Number Systems

    -

    A multiplier unit based on the 7-bit encoding scheme for mixed radix number systems and subsequent implementation on an FPGA

    Other creators
  • VLSI Architecture and FPGA Prototyping of a Digital Camera

    -

    Completed during the internship at HES-SO, Geneva with Johnathan Dustour

    #Prototyped and Manufactured an Infra-Red Camera for converting any surface into a touchscreen

    # VHDL modelling of Data acquisition modules, Clock Management Module, Serializer-Deserializer Module, Image Treatment Module and USB Arbiter Module

  • MetaCORE Development for faster communication between parallel OpenRISC processors

    -

    Development of a glue-logic for a MetaCORE to enable faster communication between two or more parallel OpenRISC processors in HDL

    Other creators
  • Image Segmentation using the Watershed Algorithm on MATLAB

    -

    Digital Signal Processing Mini-Project as part of the Course Curriculum

    Simulation and Processing on MATLAB

  • Performance Analysis of a Mixer Unit using NAND and NOR based ROM with appropriate 3:8 Decoders

    -

    VLSI-Design based mini-project as part of the Course Curriculum

    Layout for the unit was done using the VLSI-CAD, MAGIC tool for Ubuntu
    Circuit level and Logic level simulations on ngSPICE and IrSIM

    Performance Parameters considered: Switching power consumed, Area occupied, Dynamic power loss, Overall size of die for implementation

    Conclusion: NAND based ROM provided better results

    Other creators
  • Autonomous Surveyor Robot using Wireless Networks with Obstacle Avoidance

    -

    A team based embedded systems project dealing with the navigation of a robot to a destination set by the master who communicates with the bot using GSM network while the bot also avoids obstacles in the process; once it reaches the final destination, it captures images using a Raspberry PI, enabling us to have a 360 degree view of the surroundings with respect to the azimuthal plane. The image captured is immediately fed to the master who surveys the location through a Dropbox account for which…

    A team based embedded systems project dealing with the navigation of a robot to a destination set by the master who communicates with the bot using GSM network while the bot also avoids obstacles in the process; once it reaches the final destination, it captures images using a Raspberry PI, enabling us to have a 360 degree view of the surroundings with respect to the azimuthal plane. The image captured is immediately fed to the master who surveys the location through a Dropbox account for which an API was developed.

    Funding from the Department of Electronics and Communication Engineering, NITK and IEEE-NITK Chapter

    Other creators
    See project
  • Mapping Greenhouse gases using Wireless Sensor Networks

    -

    A team venture to design a low cost and competent solution to tackle environmental issues like global warming and increasing pollution levels for further climate processing applications, taken up as a project under the Institution of Electrical and Electronics Engineers Consortium, NITK Chapter

    Other creators
  • Adaptive Actuatable Resource Cataloguing Sensors (ARCSense)

    -

    Wireless water flow monitoring and actuating solution for wide geographical surveys - A team venture to design a competent solution for assessment of water resources for the Texas Instruments Innovation Challenge - 2013/14

    Other creators
  • Inter-Module Communication link for Flight Control Computer Applications

    -

    Development of serial communication protocols between a Virtex-5 FXT FPGA containing the PowerPC440 Processor and multiple ADuC-841 microcontroller boards using the PCI-Express and the Aurora protocol

Honors & Awards

  • Cleared the Placement Exams for EE457 and EE477L

    Ming Hsieh Department of Electrical Engineering, University of Southern California

    Passed the Placement Exams for the courses, EE457 (Computer Systems Organization) and EE477 (CMOS VLSI Design) - both are pre-requisite courses for graduate level courses at USC

  • Recipient of the IEEE Region-10 Student Enterprise Award 2014

    IEEE

    IEEE Region-10 Student Enterprise Award for the project on "Mapping of Greenhouse gases using Wireless Sensor Networks"

  • Bourses d’Excellence de la Confédération Suisse pour Chercheurs

    Federal Government of Switzerland

    Research Grant from the Federal Government of Switzerland for research purposes till August 2014

  • Top-30 Electronics Designer Award

    Electronics for You Magazine

    A certificate of merit in the EFY-Design Contest 2014 for presenting the Autonomous Surveryor Robot

  • Chairman's Award for Academic Excellence

    Dr. Chenraj Jain, Chairman, Jain Group of Institutions

    Cash Award of Rs. 8000/- for coming 2nd at the college level and 7th at the state level in the Pre-University Board Examinations

  • Honorary KCET-Top10 Award

    Late Dr. V S Acharya, the then Minister of Higher Education, Government of Karnataka

  • Chairman's Award for Academic Excellence

    Dr. Chenraj Jain, Chairman, Jain Group of Institutions

    Tuition fee exemption of Rs. 10,000 for topping the 11th Standard Final Examinations at SBMJC, VV Puram, Bangalore

  • Best Outgoing Student

    Dr. H Sudarshan, Karuna Trust and Ms. Vanashree Vipin Singh, IFS

  • Independence Day Scholarship Award

    Principal, Ms. Hemaa Narayan

  • Honorary School Captain

    Principal, Ms. Hemaa Narayan

  • Independence Day Scholarship Award

    Principal, Ms. Hemaa Narayan

  • Honorary School Captain

    Principal, Ms. Hemaa Narayan

Test Scores

  • All India Engineering Entrance Examination

    Score: All India Rank 3329

    Entrance Examination for admission to the prestigious National Institute of Technologies (NITs), Indian Institute of Information Technologies (Design & Manufacturing) and Institute of Mines, Dhanbad

  • IIT - Joint Entrance Examination

    Score: JEE AIR: 7337

    Entrance Examination to the prestigious Indian Institute of Technology (IITs)

  • Karnataka Common Entrance Test

    Score: 167/180; Rank - 7

    Entrance Examination for admission to Engineering Colleges in Karnataka, India

Languages

  • English

    Full professional proficiency

  • Tamil

    Professional working proficiency

  • Kannada

    Professional working proficiency

  • French

    Limited working proficiency

  • Hindi

    Limited working proficiency

Organizations

  • USC Alumni Association

    Alumnus

    - Present
  • NITK Surathkal Alumni Association - Bay Area Chapter

    Alumnus

    - Present
  • World Wildlife Fund

    Contributor

    - Present
  • Institute of Electrical and Electronics Engineers

    Student Member

    -
  • Rotary Club for International Leadership

    Honorary Member

    -

Recommendations received

More activity by Arjun

View Arjun’s full profile

  • See who you know in common
  • Get introduced
  • Contact Arjun directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Others named Arjun Aravind

Add new skills with these courses