adam_jc
adam_jc t1_irwh173 wrote
Reply to comment by MohamedRashad in [D] Reversing Image-to-text models to get the prompt by MohamedRashad
there is a version on Replicate you can try easily
adam_jc t1_iqui2g0 wrote
Reply to [P] Small problems to test out transformers? by sharp7
you can do n-digit addition of positive integers as a sequence where each digit is a token, i.e.
the problem 946 + 82 = 1028 could be made into sequence of:
9 | 4 | 6 | + | 0 | 8 | 2 | = | 1 | 0 | 2 | 8
(you could also omit + and = tokens).
Andrej Karpathy uses this task in his minGPT repo.
edit: also in that repo he does character level training on a tiny dataset of Shakespeare writing
adam_jc t1_j6eqggn wrote
Reply to comment by Red-Portal in [D] AI Theory - Signal Processing? by a_khalid1999
That’s a great paper! I like this paper too that looks at ViT’s through a signal processing lens and points out some potential flaws in the architecture for vision applications