Created by: zhiqiu
New features
OPs
add check for bernoulli cuda kernel
register bool for unsqueeze kernel
refine doc of paddle.stack