shanereid1
shanereid1 t1_j3dgdq0 wrote
Do you define transformers as Deep Learning? Cause if not then transformers.
shanereid1 t1_izg2dq6 wrote
Reply to [D] We're the Meta AI research team behind CICERO, the first AI agent to achieve human-level performance in the game Diplomacy. We’ll be answering your questions on December 8th starting at 10am PT. Ask us anything! by MetaAI_Official
What do you guys think is the most difficult game to solve using RL?
shanereid1 t1_iwz1m7p wrote
Reply to comment by saintjimmy43 in TIL in response to infamously high suicide rates at Mapo Bridge in Seoul, South Korea, the bridge was adorned with suicide prevention messages and uplifting photos. These measures weren't enacted by the government, however, instead the entire project was financed by Samsung's life insurance division by evilclownattack
Unfortunately there have been a number of studies that have shown that this type of thing actually increases the number of deaths rather than prevents them. The theory goes that if someone is in a negative headspace but they haven't considered suicide, seeing this type of message can put the idea in their head, leading to their deaths.Even just placing flowers at the spot where someone jumped can cause a spike in the number of deaths. This is why the news no longer reports that someone committed suicide but rather that they just died suddenly. Unfortunately it's incredibly difficult to tell someone that their well meaning idea is actually causing more harm, and that they shouldn't place flowers to commemorate their loved ones.
shanereid1 t1_jdlt38a wrote
Reply to [D] Do we really need 100B+ parameters in a large language model? by Vegetable-Skill-9700
Have you read about the lotto ticket hypothesis? It was a paper from a few years ago that showed that within a fully connected neural network there exists a smaller sub network that can perform equally as well, even when the subnetwork is as low as a few % of the size of the original network. AFAIK they only proved this for MLP and CNNs. Its almost certain that the power of these LLMs can be distilled in some fashion without significantly degrading performance.