Comparative Performance Evaluation of Multi-GPU MLFMM Implementation for 2-D VIE Problems


Scalable Parallel DBIM Solutions of Inverse-Scattering Problems


Thoughts on Massively-Parallel Heterogeneous Computing for Solving Large Problems


Large Inverse-Scattering Solutions with DBIM on GPU-Enabled Supercomputers


Adaptive Cache Bypass and Insertion for Many-Core Accelerators



GPU Neural Network for GPGPUSim

A from-scratch feed-forward network in CUDA 4.0 suitable for GPGPUSim


Docker images with latex

ECE408 / CS483 Course Development

Students add a convolution layer to MXNet


Generate PDF, docx, html, and txt resume/cv from a single markdown source.

Cognitive Application Builder

Cognitive Application Builder

High-Performance Application Studies

Tools and Techniques for Code Acceleration


An LLVM Version Manager

Positions and Experience


  • Summer 2017 - Research Intern for Optimized CLOUD Systems, IBM TJ Watson Research Center, Yorktown Heights, NY
  • Summer 2014, Summer 2015 - Research Intern, MulticoreWare Inc., Champaign, IL
  • Summer 2013 - Co-op Engineer Floating-Point RTL, AMD, Fort Collins, CO
  • Summer 2012 - Co-op Engineer Physical Design, AMD, Fort Collins, CO


I have been a teaching assistant for the following courses:

  • ECE408/CS483: Heterogeneous Parallel Programming at the University of Illinois
  • E155: Microprocesser-based Systems: Design & Applications at Harvey Mudd College
  • E85: Digital Electronics and Computer Architecture at Harvey Mudd College

I have also been a teaching assistant for the Programming and Tuning Massively Parallel Systems (PUMPS) summer school in Barcelona since 2014.

Recent & Upcoming Talks

Comparative Performance Evaluation of Multi-GPU MLFMM Implementation for 2-D VIE Problems
Fri, Jun 23, 2017
RAI: A Scalable Submission System for GPU Applications
Mon, May 8, 2017
GPU Performance Nuggets
Wed, Jun 15, 2016

Awards and Recognition

Mavis Future Faculty Fellowship - UIUC 2017-2018

Top-20 Poster - 2017 NVIDIA GPU Technology Conference

Teacher Ranked as Excellent by Students - UIUC Fall 2015


Web-based method for physical object delivery though use of 3d printing technology

United States 20140122579

Filed November 1, 2012


Board of Governors, University YMCA.

Executive Board, Amnesty International at UIUC.

Recent Posts

I manage the two Minsky machines available to the C3SR center at Illinois. Minsky Machine Overview Product IBM S822LC Model 8335-GTB CPU 2x Power8 GPU 4x NVIDIA P100 w/ 16GB RAM RAM 512 GB Each P8 CPU has 10 cores with 8-way SMT, yielding 80 threads per CPU or 160 threads on each Minsky machine.


I’m helping teach the Programming and tUning Massively Parallel Systems (PUMPS) hosted by the Barcelona Supercomputing Center at UPC Barcelona, Spain!


I’m attending CEM 17 hosted at UPC Barcelona, Spain!


I’ve made my first trip to NVIDIA’s GPU Technology Conference this year, to present some work with my collaborators Abdul Dakkak and Cheng Li. I’ve wanted to attend GTC ever since my first year in the IMPACT group, so this is an exciting trip for me!



  • 222 Coordinated Science Lab, 1308 W. Main St., Urbana, Illinois 61801
  • Face-to-face by appointment