Fluid performance tuning plan
Created by: reyoung
We plan to tune fluid's performance with a loop with three steps:
- Profile: To figure out which part of the fluid is slow.
- Find problems & Give a fix: We will discuss and find the problems based on profile results.
- Profile: To confirm the problems has been solved and the performance is improved.
There are several jobs for these three steps:
- Find a machine with docker and GPU for profiling. @jacquesqiao
- Neural network configurations for CNN, LSTM, etc. @qingqing01 @dzhwinter
- Setup an environment for profiling. @chengduoZH
- Use cProfile for Python, yap for Python/C++, nvprof for CUDA
- Find problems: All members together.
- Fix GPU problems: @jacquesqiao @qingqing01
- Fix CPU problems: @dzhwinter
- Fix Python problems: TODO