• Y
    Implement the GPU kernel of fc operator (#19687) · a65c728e
    Yiqun Liu 提交于
    * Refine the codes related to fc op.
    
    * Add GPU implementation for fc functor.
    
    * Apply fc_fuse_pass in GPU inference.
    test=develop
    
    * Change the cmake for fc op.
    
    * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
    
    * Add an attribute to set the activation type in fc_op.
    
    * Enhance the unittest of fc_op.
    test=develop
    
    * Remove the declaration of FCOpGrad back to the header file.
    test=develop
    
    * Set default value for newly added arguments in test_fc_op.
    test=develop
    a65c728e
fc_op.cu.cc 831 字节