<spanclass="c1"># If you need the data collected by the collector to contain logit key which reflect the probability of the action, you can change the key to be True.</span>
<spanclass="c1"># In Guided cost Learning, we need to use logit to train the reward model, we change the key to be True.</span>
<spanclass="c1"># Default collector_logit to False.</span>