We implemeted 4 models for now: the convolutional deep-structured semantic model (CDSSM, CNN-based), the ___Infer Sent Model___ (RNN-based), the shortcut-stacked encoder (SSE, RNN-based), and the decomposed attention model (DecAtt, attention-based).
We implemeted 4 models for now: the convolutional deep-structured semantic model (CDSSM, CNN-based), the InferSent model (RNN-based), the shortcut-stacked encoder (SSE, RNN-based), and the decomposed attention model (DecAtt, attention-based).
@@ -176,8 +176,6 @@ We implemeted 4 models for now: the convolutional deep-structured semantic model
...
@@ -176,8 +176,6 @@ We implemeted 4 models for now: the convolutional deep-structured semantic model
## Results
## Results
In our experiment, we found that LSTM-based models outperformed convolution-based models. The DecAtt model has fewer parameters than LSTM-based models, but is sensitive to hyper-parameters.
|Model|Config|dev accuracy| test accuracy
|Model|Config|dev accuracy| test accuracy
|:----:|:----:|:----:|:----:|
|:----:|:----:|:----:|:----:|
|cdssmNet|cdssm_base|83.56%|82.83%|
|cdssmNet|cdssm_base|83.56%|82.83%|
...
@@ -185,8 +183,9 @@ In our experiment, we found that LSTM-based models outperformed convolution-base
...
@@ -185,8 +183,9 @@ In our experiment, we found that LSTM-based models outperformed convolution-base
|InferSentNet|infer_sent_v1|87.15%|86.62%|
|InferSentNet|infer_sent_v1|87.15%|86.62%|
|InferSentNet|infer_sent_v2|88.55%|88.43%|
|InferSentNet|infer_sent_v2|88.55%|88.43%|
|SSENet|sse_base|88.35%|88.25%|
|SSENet|sse_base|88.35%|88.25%|
In our experiment, we found that LSTM-based models outperformed convolution-based models. The DecAtt model has fewer parameters than LSTM-based models, but is sensitive to hyper-parameters.