Type something to search...

Programming

 From 5 Seconds to 480ms: A Performance Optimization Journey

From 5 Seconds to 480ms: A Performance Optimization Journey

The goal was simple: read a CSV of 10 million trades, aggregate them into OHLCV bars (Open/High/Low/Close/Volume) by time window, and do it fast in C++ and then Rust.

Read More
CLion with multiple CUDA SDKs

CLion with multiple CUDA SDKs

An article illustrating a way to define multiple toolchain in CLion, each with a different version of the CUDA SDK...

Read More
Dali operator for multi-page TIFF's

Dali operator for multi-page TIFF's

This article explains how I wrote a simple C++ Dali operator to load a multichannel TIFF...

Read More
Homography for tensorflow

Homography for tensorflow

An article about how to implement an homography function in Tensorflow 1.x...

Read More