Christos Kozyrakis

Stanford, California, United States

Sign in to view Christos’ full profile

Christos can introduce you to 10+ people at Stanford University

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

10K followers 500+ connections

View mutual connections with Christos

Christos can introduce you to 10+ people at Stanford University

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

Stanford University

University of California, Berkeley

About

Christos Kozyrakis is a Professor of Electrical Engineering and Computer Science at…

Activity

📢 [Upcoming AI Entrepreneurship Series Talks] Guest Name: Ivan Burazin Title of the speech: Scaling RL Rollouts: Agent-Native Infrastructure with…

📢 [Upcoming AI Entrepreneurship Series Talks] Guest Name: Ivan Burazin Title of the speech: Scaling RL Rollouts: Agent-Native Infrastructure with…

Liked by Christos Kozyrakis
Sovereign AI is a right, not a choice. But, it cannot be achieved by talking -- it must be built. AI is a five-layer cake: Applications → Models →…

Sovereign AI is a right, not a choice. But, it cannot be achieved by talking -- it must be built. AI is a five-layer cake: Applications → Models →…

Liked by Christos Kozyrakis
Last Friday, Resolve AI officially turned 2 years old! It's always fun celebrating with the people who make it all happen. I'm reminded daily of how…

Last Friday, Resolve AI officially turned 2 years old! It's always fun celebrating with the people who make it all happen. I'm reminded daily of how…

Liked by Christos Kozyrakis

Join now to see all activity

Experience & Education

Stanford University

*********

******* *** ***** ********
******

********* ********
********** ** *********** ********

*** ******** ******* undefined

1999 - 2002
********** ** *********** ********

******** ****** ******** *******

1996 - 1999

View Christos’s full experience

See their title, tenure and more.

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Publications

Convolution engine: balancing efficiency & flexibility in specialized computing

Proceedings of the 40th Annual International Symposium on Computer Architecture / ACM June 23, 2013
Other authors
See publication
A Case of System-level Hardware/Software Co-design and Co-verification of a Commodity Multi-Processor System with Custom Hardware

CODES+ISSS October 7, 2012
A Hardware Simulation/Verification Case-Study Paper

Other authors
See publication
Towards Energy-Proportional Datacenter Memory with Mobile DRAM

Proceedings of the 39th Intl. Symposium on Computer Architecture Jun 2012
Other authors
See publication
Decoupling Datacenter Studies from Access to Large-Scale Applications: A Modeling Approach for Storage Workloads

IEEE International Symposium on Workload Characterization (IISWC) Nov 2011
We propose a modeling and characterization framework for large-scale storage applications. As part of this framework we use a state diagram-based storage model, extend it to a hierarchical representation and implement a tool that consistently recreates I/O loads of DC applications. We present the principal features of the framework that allow accurate modeling and generation of storage workloads and the validation process performed against ten original DC applications traces. Furthermore, using…

We propose a modeling and characterization framework for large-scale storage applications. As part of this framework we use a state diagram-based storage model, extend it to a hierarchical representation and implement a tool that consistently recreates I/O loads of DC applications. We present the principal features of the framework that allow accurate modeling and generation of storage workloads and the validation process performed against ten original DC applications traces. Furthermore, using our framework, we perform an indepth, per-thread characterization of these applications and provide insights on their behavior. Finally, we explore two practical applications of this methodology: SSD caching and defragmentation benefits on enterprise storage. In both cases we observe significant speedup for most of the examined applications. Since knowledge of the workload’s spatial and temporal locality is necessary to model these use cases, our framework was instrumental in quantifying their performance benefits. The proposed methodology provides a detailed understanding on the storage activity of large-scale applications and enables a wide spectrum of storage studies without the requirement for access to real applications and full application deployment.

Other authors
See publication
Accurate Modeling and Generation of Storage I/O for Datacenter workloads

Exascale Evaluation and Research Techniques Workshop (EXERT 2011) Mar 2011
Tools that confidently recreate I/O workloads have become a critical requirement in designing e?cient storage systems for datacenters (DCs), since potential ine?ciencies get aggregated over several thousand servers. Designing performance, power and cost optimized systems requires a deep understanding of target workloads, and mechanisms to effectively model different design choices. Traditional benchmarking is invalid in cloud datastores, representative storage profiles are hard to obtain…

Tools that confidently recreate I/O workloads have become a critical requirement in designing e?cient storage systems for datacenters (DCs), since potential ine?ciencies get aggregated over several thousand servers. Designing performance, power and cost optimized systems requires a deep understanding of target workloads, and mechanisms to effectively model different design choices. Traditional benchmarking is invalid in cloud datastores, representative storage profiles are hard to obtain, while replaying the entire application in all storage configurations is impractical. Despite these issues, current workload generators are not comprehensive enough to accurately reproduce key aspects of real application patterns. Some of these features include spatial and temporal locality, as well as tuning the intensity of the workload to emulate different storage system behaviors.To address these limitations, we use a state diagram-based storage model, extend it to a hierarchical representation and implement a tool that consistently recreates I/O loads of
DC applications. We present the design of the tool and the
validation process performed against six original DC applications traces. We explore the practical applications of this methodology in two important storage challenges - 1) SSD caching and 2) defragmentation bene?ts on enterprise storage. In both cases we observe significant storage speedup for most of the DC applications. Since knowledge of the workload's spatial locality is necessary to model these use cases, our tool was instrumental in quantifying their performance benefits.

Other authors
See publication
Accurate Modeling and Generation of Storage I/O for Datacenter workloads

Exascale Evaluation and Research Techniques Workshop (EXERT 2011) Mar 2011
Tools that confidently recreate I/O workloads have become a critical requirement in designing e?cient storage systems for datacenters (DCs), since potential ine?ciencies get aggregated over several thousand servers. Designing performance, power and cost optimized systems requires a deep understanding of target workloads, and mechanisms to effectively model different design choices. Traditional benchmarking is invalid in cloud datastores, representative storage profiles are hard to obtain…

Tools that confidently recreate I/O workloads have become a critical requirement in designing e?cient storage systems for datacenters (DCs), since potential ine?ciencies get aggregated over several thousand servers. Designing performance, power and cost optimized systems requires a deep understanding of target workloads, and mechanisms to effectively model different design choices. Traditional benchmarking is invalid in cloud datastores, representative storage profiles are hard to obtain, while replaying the entire application in all storage configurations is impractical. Despite these issues, current workload generators are not comprehensive enough to accurately reproduce key aspects of real application patterns. Some of these features include spatial and temporal locality, as well as tuning the intensity of the workload to emulate different storage system behaviors.To address these limitations, we use a state diagram-based storage model, extend it to a hierarchical representation and implement a tool that consistently recreates I/O loads of
DC applications. We present the design of the tool and the
validation process performed against six original DC applications traces. We explore the practical applications of this methodology in two important storage challenges - 1) SSD caching and 2) defragmentation bene?ts on enterprise storage. In both cases we observe significant storage speedup for most of the DC applications. Since knowledge of the workload's spatial locality is necessary to model these use cases, our tool was instrumental in quantifying their performance benefits.

Other authors
See publication
Server Engineering Insights for Large Scale Online Services

IEEE Micro Issue on Datacenter Computing, Vol. 30, No. 4 Jul 2010
The rapid growth of online services in the last decade has led to the development of large datacenters to host these workloads. These large scale online, user-facing services have unique enginerering and capacity provisioning design requirements. The authors explore these requirements focusing on systems balancing, the impact of technology trends and the challenges of online services workloads.

Other authors
Eigenbench: A Simple Exploration Tool for Orthogonal TM Characteristics

IISWC'10 (best paper award) 2010
Other authors
Comparing memory systems for chip multiprocessors

Proceedings of the 34th annual international symposium on Computer architecture (ISCA '07) Jun 2007
There are two basic models for the on-chip memory in CMP systems:hardware-managed coherent caches and software-managed streaming memory. This paper performs a direct comparison of the two modelsunder the same set of assumptions about technology, area, and computational capabilities. The goal is to quantify how and when they differ in terms of performance, energy consumption, bandwidth requirements, and latency tolerance for general-purpose CMPs. We demonstrate that for data-parallel…

There are two basic models for the on-chip memory in CMP systems:hardware-managed coherent caches and software-managed streaming memory. This paper performs a direct comparison of the two modelsunder the same set of assumptions about technology, area, and computational capabilities. The goal is to quantify how and when they differ in terms of performance, energy consumption, bandwidth requirements, and latency tolerance for general-purpose CMPs. We demonstrate that for data-parallel applications, the cache-based and streaming models perform and scale equally well. For certain applications with little data reuse, streaming scales better due to better bandwidth use and macroscopic software prefetching. However, the introduction of techniques such as hardware prefetching and non-allocating stores to the cache-based model eliminates the streaming advantage. Overall, our results indicate that there is not sufficient advantage in building streaming memory systems where all on-chip memory structures are explicitly managed. On the other hand, we show that streaming at the programming model level is particularly beneficial, even with the cache-based model, as it enhances locality and creates opportunities for bandwidth optimizations. Moreover, we observe that stream programming is actually easier with the cache-based model because the hardware guarantees correct, best-effort execution even when the programmer cannot fully regularize an application's code.

Other authors
See publication
Evaluating MapReduce for Multi-core and Multiprocessor Systems

February 10, 2007
Won best paper award at the IEEE High Performance Computer Architecture conference

Other authors
See publication

Join now to see all publications

Patents

Low power programmable image processor

Issued September 22, 2014 US 14/492,535

A convolution image processor includes a load and store unit, a shift register unit, and a mapping unit. The load and store unit is configured to load and store image pixel data and allow for unaligned access of the image pixel data. The shift register is configured to load and store at least a portion of the image pixel data from the load and store unit and concurrently provide access to each image pixel value in the portion of the image pixel data. The mapping unit is configured to generate a…

A convolution image processor includes a load and store unit, a shift register unit, and a mapping unit. The load and store unit is configured to load and store image pixel data and allow for unaligned access of the image pixel data. The shift register is configured to load and store at least a portion of the image pixel data from the load and store unit and concurrently provide access to each image pixel value in the portion of the image pixel data. The mapping unit is configured to generate a number of shifted versions of image pixel data and corresponding stencil data from the portion of the image pixel data, and concurrently perform one or more operations on each image pixel value in the shifted versions of the portion of the image pixel data and a corresponding stencil value in the corresponding stencil data.

See patent

More activity by Christos

Our partnership with NVIDIA is foundational. NVIDIA is our most important partner for both training and inference, and our entire compute fleet runs…

Our partnership with NVIDIA is foundational. NVIDIA is our most important partner for both training and inference, and our entire compute fleet runs…

Liked by Christos Kozyrakis
A small moment of joy today: someone cited my Impact Market paper. (The Impact Market to Save Conference Peer Review: Decoupling Dissemination and…

A small moment of joy today: someone cited my Impact Market paper. (The Impact Market to Save Conference Peer Review: Decoupling Dissemination and…

Liked by Christos Kozyrakis
Moving data across systems is hard. You often need ACID properties which are challenging to implement. Join Elijah D. and I, tomorrow, Wednesday the…

Moving data across systems is hard. You often need ACID properties which are challenging to implement. Join Elijah D. and I, tomorrow, Wednesday the…

Liked by Christos Kozyrakis

View Christos’ full profile

See who you know in common
Get introduced
Contact Christos directly

Join to view full profile

Other similar profiles

Morteza Mardani

Morteza Mardani

NVIDIA

12K followers
Santa Clara, CA

View Profile
James Landay

James Landay

Stanford Institute for Human-Centered Artificial Intelligence (HAI)

11K followers
San Francisco Bay Area

View Profile
Jelani Nelson

Jelani Nelson

University of California, Berkeley

22K followers
Berkeley, CA

View Profile
David J. Malan

David J. Malan

Harvard University

508K followers
Cambridge, MA

View Profile
Carlos Gomez Uribe

Carlos Gomez Uribe

Instituto de Física, Universidad Nacional Autónoma de México (UNAM)

5K followers
Mountain View, CA

View Profile
Mohammad Alizadeh

Mohammad Alizadeh

Massachusetts Institute of Technology

2K followers
Greater Boston

View Profile
Reza N.

Reza N.

Lyte

4K followers
San Francisco Bay Area

View Profile
Ying-Yi Liang

Ying-Yi Liang

Meta

976 followers
Menlo Park, CA

View Profile
Leendert van Doorn

Leendert van Doorn

Qualcomm

10K followers
Austin, TX

View Profile
Nidhi Vyas

Nidhi Vyas

Google DeepMind

5K followers
Cupertino, CA

View Profile

Explore more posts

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

See your mutual connections View mutual connections with Christos Christos can introduce you to 10+ people at Stanford University Sign in with Email or New to LinkedIn? Join now By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

About

Activity

📢 [Upcoming AI Entrepreneurship Series Talks] Guest Name: Ivan Burazin Title of the speech: Scaling RL Rollouts: Agent-Native Infrastructure with…

Liked by Christos Kozyrakis

Sovereign AI is a right, not a choice. But, it cannot be achieved by talking -- it must be built. AI is a five-layer cake: Applications → Models →…

Liked by Christos Kozyrakis

Last Friday, Resolve AI officially turned 2 years old! It's always fun celebrating with the people who make it all happen. I'm reminded daily of how…

Liked by Christos Kozyrakis

Experience & Education

Stanford University

********* ** ********** *********** * ******** *******

View Christos’s full experience

See their title, tenure and more.

Publications

Proceedings of the 40th Annual International Symposium on Computer Architecture / ACM June 23, 2013

CODES+ISSS October 7, 2012

Proceedings of the 39th Intl. Symposium on Computer Architecture Jun 2012

IEEE International Symposium on Workload Characterization (IISWC) Nov 2011

Exascale Evaluation and Research Techniques Workshop (EXERT 2011) Mar 2011

Exascale Evaluation and Research Techniques Workshop (EXERT 2011) Mar 2011

Server Engineering Insights for Large Scale Online Services

IEEE Micro Issue on Datacenter Computing, Vol. 30, No. 4 Jul 2010

Eigenbench: A Simple Exploration Tool for Orthogonal TM Characteristics

IISWC'10 (best paper award) 2010

Proceedings of the 34th annual international symposium on Computer architecture (ISCA '07) Jun 2007

February 10, 2007

Patents

Issued September 22, 2014 US 14/492,535

More activity by Christos

Our partnership with NVIDIA is foundational. NVIDIA is our most important partner for both training and inference, and our entire compute fleet runs…

Liked by Christos Kozyrakis

A small moment of joy today: someone cited my Impact Market paper. (The Impact Market to Save Conference Peer Review: Decoupling Dissemination and…

Liked by Christos Kozyrakis

Moving data across systems is hard. You often need ACID properties which are challenging to implement. Join Elijah D. and I, tomorrow, Wednesday the…

Liked by Christos Kozyrakis

View Christos’ full profile

Other similar profiles

Morteza Mardani

James Landay

Jelani Nelson

David J. Malan

Carlos Gomez Uribe

Mohammad Alizadeh

Reza N.

Ying-Yi Liang

Leendert van Doorn

Nidhi Vyas

Explore more posts

Explore top content on LinkedIn

Add new skills with these courses

Complete Guide to AI and Data Science for SQL: From Beginner to Advanced

Integrate Microsoft Graph in Your Applications

Data Quality: Measure, Improve, and Enforce Reliable Systems

View mutual connections with Christos

Christos can introduce you to 10+ people at Stanford University

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.