Mark Gottscho, Ph.D.

Computer architect

I am currently a Senior Manager at SambaNova Systems, where I founded and continue to direct the compiler operators team. Our responsibilities include development of internal MLIR dialects, formalization of operators and mixed precision semantics, and implementation of high performance operator kernels. We don’t target GPUs, but instead lower to our custom AI accelerator chip that’s called the Reconfigurable Dataflow Unit (RDU). It’s capable of both training and inference. As the compiler technical lead for a future RDU chip, I’m also driving several hardware/software codesign efforts and bootstrapping a new architecture specification workflow.

My interests include AI graph compilation, dataflow architecture, agile hardware development methodologies, memory systems, and hardware reliability.

Prior to my work at SambaNova, I was a chip architect, microarchitect, RTL designer, and cross-functional technical lead in Google’s Tensor Processing Unit (TPU) team. All of my projects at Google successfully landed: TPUv4i, TPUv4, TPUv5e, TPUv5p, and Trillium have all been key technologies underlying Google’s AI leadership. My contributions to the TPU spanned most of the subsystems, although the bulk of my impact was on the memory system, host interface, and inter-chip interconnect; I won a Feat of Engineering for my work on the TPUv4 on-chip interconnect.

Alongside my co-lead, I also landed an internal Chip Development Kit (CDK) from scratch. It is a set of comprehensively specified, parameterizable, and formally verified RTL libraries that implement common dataflow primitives with high performance and minimal area. CDK significantly improved the hardware development velocity and lowered bug rates for several different Google chips (not just TPUs) using a correct-by-construction approach. The vast majority of the gates in recent TPU chips are attributed to CDK modules.

I received my Ph.D. in Electrical Engineering from UCLA in 2017 for my dissertation titled "Opportunistic Memory Systems in Presence of Hardware Variability." My research explored techniques for exploiting inherent imperfections in hardware to improve energy efficiency, reduce cost, and gracefully recover from errors. In 2016, I won the Qualcomm Innovation Fellowship and the UCLA Dissertation Year Fellowship for my co-invention of "Software-Defined Error-Correcting Codes."

I’m a dad, husband, son, brother, uncle, and friend.

Outside of work, I’m a long-time road racing and sports car enthusiast. Whenever not with my family, you are likely to find me road racing in my Spec E46, simracing online, or enjoying Bay Area backroads.

Blog | CV