yahma
yahma t1_jd6ptt5 wrote
Reply to comment by RedditLovingSun in [P] OpenAssistant is now live on reddit (Open Source ChatGPT alternative) by pixiegirl417
Good point, I forgot about this.
yahma t1_jd24u9z wrote
Would switching the base pythia-12b model for llama-13b improve things?
yahma t1_j7tc4h4 wrote
Reply to comment by YourDadsBoyfriend69 in I asked Microsoft's 'new Bing' to write me a cover letter for a job. It refused, saying this would be 'unethical' and 'unfair to other applicants.' by TopHatSasquatch
Which chinese one?
yahma t1_j7a3pcn wrote
Reply to [D] Are large language models dangerous? by spiritus_dei
Google wants you to think they are dangerous, so they can stifle the competition by getting regulations and restrictions on AI passed.
yahma t1_j6evm31 wrote
Reply to [P] AI Content Detector by YoutubeStruggle
The problem with many of these AI Content detectors is they too often flag human written text as AI Generated.
yahma t1_j4owot0 wrote
Reply to comment by MegavirusOfDoom in [D] Fine-tuning open source models on specific tasks to compete with ChatGPT? by jaqws
This may be the size of the datasets, but i it's hard to say how many parameters will be needed for a good llm that's just really good at explaining code.
yahma t1_j2ssc01 wrote
Reply to comment by C0hentheBarbarian in [R] Massive Language Models Can Be Accurately Pruned in One-Shot by starstruckmon
I wasn't very impressed with BLOOMZ. Responses seem short and optimized for Q/A style output. Perhaps Zero-Shot and single-shot worked better than Bloom, but Bloom seemed to produce better output for stories or writing in general.
I was only able to test the 6B models though, so not sure how the 176B models compare.
yahma t1_j2ss1ox wrote
So with pruning and 8-bit quantization, are we able to run BLOOM-176B on a single GPU yet?
yahma t1_j1dulgw wrote
Based on my testing, none of the open source models are anywhere near as good as ChatGPT (or even davinci-03 .. the lastest GPT-3 snapshot).
I think open source models need more fine-tuning and some RL techniques applied to get anywhere close.
yahma t1_jdiqmj2 wrote
Reply to comment by Silphendio in [D] What is the best open source chatbot AI to do transfer learning on? by to4life4
Are the results as good as Alpaca with LLAMA base?