Created by: ForFishes
This Op can calculate batch fc. Notice: It currently supports GPU device. This Op exists in contrib, which means that it is not shown to the public. Part dim of batch_fc_op: X.dim = slot_pairs_num * ins_num * in_dim W.dim = slot_pairs_num * in_dim * out_dim b.dim = slot_pairs_num * out_dim output.dim = slot_pairs_num * ins_num * out_dim