RNN解决序列太长存储激活值过大问题的方法:TBPTT
原始论文:[Williams, Ronald J., and Jing Peng. "An efficient gradient-based algorithm for on-line training of recurrent network trajectories."]TBPTT :Truncated Back Propagation Through TimeTBPTT中,每次处理一...