• Y
    Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733) · c67c8758
    Yiqun Liu 提交于
    * Refine the codes related to fc op.
    
    * Add GPU implementation for fc functor.
    
    * Apply fc_fuse_pass in GPU inference.
    test=develop
    
    * Change the cmake for fc op.
    
    * Change PADDLE_ENFORCE to PADDLE_ENFORCE_EQ.
    
    * Add an attribute to set the activation type in fc_op.
    
    * Enhance the unittest of fc_op.
    test=develop
    
    * Remove the declaration of FCOpGrad back to the header file.
    test=develop
    
    * Set default value for newly added arguments in test_fc_op.
    test=develop
    
    * Enhance fc_fuse_pass to enable fusing relu.
    
    * Allow print the shapes of var_desc in graph.
    test=develop
    
    * Enhance fc_fuse_pass_tester.
    
    * Remove the use of PADDLE_ENFORCE.
    test=develop
    
    * Correct the number of ops after fusing.
    test=develop
    
    * Fix a typo.
    test=develop
    
    * Set activation_type to null when there is no relu in fc.
    test=develop
    
    * Refine fc_fuse_pass's codes.
    
    * Enable the set of shape for tensor.
    
    * Refine repeated_fc_relu_pass and add unittest.
    test=develop
    c67c8758
graph_pattern_detector.cc 69.9 KB