Submitted by rezayazdanfar t3_11qfl2o in deeplearning
rezayazdanfar OP t1_jcu24bm wrote
Reply to comment by DeepLearningStudent in How To Scale Transformers’ Memory up to 262K Tokens With a Minor Change? by rezayazdanfar
:) happy to hear it, hope you found it practical in your work. I also aim to use it in my future project. :)
Viewing a single comment thread. View all comments