See below various articles related to writing performant code.
- Profiling
- Memory Management
- Caching Data For Optimal Performance
- Threading, Synchronization And Parallelism
- Analyzing GC logs
Here is a link to a github repository where I have included several examples of concurrency in C++ including thread safe containers, atomic operations, barriers, latches, parallel algorithms such as parallel quick sort, high efficiency parallel operations with non sequential memory access such as release consume and release acquire and more.