Recent comments in /f/MachineLearning
nateharada t1_jeh5bir wrote
Reply to comment by lacker in [D][N] LAION Launches Petition to Establish an International Publicly Funded Supercomputing Facility for Open Source Large-scale AI Research and its Safety by stringShuffle
I personally feel we need large scale collaboration, not each lab having a small increase. Something like a James Webb telescope or a CERN. If they make a large cluster that's just time shared between labs that's not as useful IMO as allowing many universities to collaborate on a truly public LLM that competes with the biggest private AI organizations.
sEi_ t1_jeh4rid wrote
Reply to comment by Scew in [D][N] LAION Launches Petition to Establish an International Publicly Funded Supercomputing Facility for Open Source Large-scale AI Research and its Safety by stringShuffle
Is that you Chad?
master3243 t1_jeh48sn wrote
Reply to comment by [deleted] in [News] Twitter algorithm now open source by John-The-Bomb-2
I don't take any CEO's words at face value without considering the monetary values and incentives behind that tongue.
A large project like this being open-sourced, even if it's a very old or heavily stripped down version, is always a great thing for the community.
FinancialElephant t1_jeh33j9 wrote
Reply to comment by Erosis in [News] Twitter algorithm now open source by John-The-Bomb-2
Most infrastructure code like computer vision code, device drivers, etc are either not culturally relevant or have little cultural relevance.
I don't think it makes any sense to prioritize them when things like twitter have much more direct cultural impact. It would be great if my network card driver was open source, but does it really matter? Is it worth prioritizing? Will it likely have any cultural relevance? To most people the answer to all these questions is no.
codingwoman_ t1_jeh31os wrote
Reply to comment by midnitte in [News] Twitter algorithm now open source by John-The-Bomb-2
I'm still able to access this link though, even on private browser
i_use_3_seashells t1_jeh2wih wrote
Reply to comment by AsAnAILanguageModel_ in [News] Twitter algorithm now open source by John-The-Bomb-2
Then who did, if not the owner/CEO
ninjasaid13 t1_jeh2s4o wrote
Reply to comment by TitusPullo4 in [R] The Debate Over Understanding in AI’s Large Language Models by currentscurrents
>Consciousness is having a subjective experience.
and what's the definition of subjective?
AsAnAILanguageModel_ t1_jeh2owx wrote
Reply to comment by Educational-Net303 in [News] Twitter algorithm now open source by John-The-Bomb-2
Elon didn’t open source it.
midnitte t1_jeh2jjj wrote
Reply to comment by Necessary-Meringue-1 in [News] Twitter algorithm now open source by John-The-Bomb-2
>An aside, if you want a chuckle, search the term "Elon" in the repo:https://github.com/twitter/the-algorithm/search?q=elon
codingwoman_ t1_jeh2iw5 wrote
Reply to comment by midnitte in [News] Twitter algorithm now open source by John-The-Bomb-2
Well devil is in the detail, don't miss the fun part in commit messages :)
Please note we have force-pushed a new initial commit in order to remove some publicly-available Twitter user information. Note that this process may be required in the future.
midasp t1_jeh2awl wrote
It's kinda nice to see PageRank is still being used as one of the components of the algorithm
midnitte t1_jeh1mia wrote
Reply to comment by codingwoman_ in [News] Twitter algorithm now open source by John-The-Bomb-2
Seems to be deleted now, which wouldn't be surprising...
[deleted] t1_jeh1lzg wrote
Reply to comment by [deleted] in [News] Twitter algorithm now open source by John-The-Bomb-2
[removed]
ZestyData t1_jeh198p wrote
Reply to comment by lordofbitterdrinks in [News] Twitter algorithm now open source by John-The-Bomb-2
This quite obviously isn't the repo used by twitter.
It is a pretty large and well put together documentation epic & consolidation of multiple microservices.
Whether the content is 100% reflective of whats deployed is completely unclear. But its not "fake" that's for sure, its genuinely too many man-years of work to not be in-essence real.
ZestyData t1_jeh12gm wrote
Reply to comment by pier4r in [News] Twitter algorithm now open source by John-The-Bomb-2
Idk man as a fairly well seasoned MLE I find their general architecture and scale of their combined models to be fascinating in-and-of itself.
Twitter sucks ass - but this is a beautiful piece of ML Engineering.
PassingTumbleweed t1_jeh1248 wrote
Reply to comment by KD_A in [P] CAPPr: use OpenAI or HuggingFace models to easily do zero-shot text classification by KD_A
One thing I've seen with these LLMs is that you can prompt them with the classes using sort of a multiple choice style. It would be interesting to experiment with whether this can stabilize the outputs and reduce the amount of out of vocabulary predictions you get
KD_A OP t1_jeh0ygl wrote
Reply to comment by PassingTumbleweed in [P] CAPPr: use OpenAI or HuggingFace models to easily do zero-shot text classification by KD_A
Yup it gets totally discarded. Hopefully, the conditional probability of bird is higher than cat or dog.
fool126 t1_jeh0t01 wrote
Reply to [D] Simple Questions Thread by AutoModerator
What are the dominant methods for solving contextual bandit problems?
[deleted] t1_jeh0smk wrote
Reply to comment by londons_explorer in [News] Twitter algorithm now open source by John-The-Bomb-2
[removed]
PassingTumbleweed t1_jeh0p1j wrote
Reply to comment by KD_A in [P] CAPPr: use OpenAI or HuggingFace models to easily do zero-shot text classification by KD_A
I'm curious to get your thoughts about a simple example where you have three classes: cat, dog, and bird. What happens if the top-1 prediction is "eagle"? Does that probability mass get discarded? Because it should probably go into the bird category
HelloItMeMort t1_jeh0mbd wrote
Reply to comment by codingwoman_ in [News] Twitter algorithm now open source by John-The-Bomb-2
He can’t accept that nobody wants him on their timeline
Necessary-Meringue-1 t1_jegz6md wrote
Reply to comment by t98907 in [News] Twitter algorithm now open source by John-The-Bomb-2
I think we can safely go with Occam's Razor here. I would assume the "influential celebrity" is the "power_user" type, see: https://i.imgur.com/s6ntUil.png
​
Either way, I'm not surprised they are giving tweets from Musk their own type. Why wouldn't they. It probably became necessary to deal with his antics.
t98907 t1_jegy79j wrote
Reply to comment by Necessary-Meringue-1 in [News] Twitter algorithm now open source by John-The-Bomb-2
However, it does not seem to affect the recommendation algorithm.
FermiAnyon t1_jeh5jr0 wrote
Reply to comment by Ricenaros in [D] Turns out, Othello-GPT does have a world model. by Desi___Gigachad
Yeah, I was just saying it's a limited number and that the specific number isn't important. The important thing is that there a limited number. That doesn't imply anything about infinity except that infinity is off the table as an option.