Submitted by Devinco001 t3_yvngga in MachineLearning
Devinco001 OP t1_iwgaljf wrote
Reply to comment by cautioushedonist in [D] Phonetic Algorithm Spellcheck Metric by Devinco001
Yes, for example if I have a word 'baend' and I make it go through soundex + levenshtein, it gives me 'band' and 'bend', both with a distance of 1. So I want to basically decide which of the words would be a better choice.
Yes, the LM idea is awesome. But I am a bit low on memory and disk space. On hugging face, the LM which pops up for filling mask is quite large, with significant computational time.
Can this be done without LM, like some frequency tables, etc.? Or is there an LM sort of thing where I can input the highest ranked soundex words and get the confidence score for each? Or is there an optimized LM for this task, I tried finding it but didn't get one till now.
cautioushedonist t1_iwgzytj wrote
- Can you confirm if the example on this webpage doesn't work for you in terms of size and memory?
Example is under How to use on
https://huggingface.co/distilbert-base-uncased?text=The+goal+of+life+is+%5BMASK%5D
- Now, if you're open to paying for GPT3 services then this answer might be helpful.
https://stackoverflow.com/questions/73370817/how-to-use-gpt-3-for-fill-mask-tasks
This will be API calls and so you wouldn't need to worry about inference times and sizes.
So, you can either find the smallest LM possible that can work with fill-mask or use some API service to get around size/memory bottlenecks.
Devinco001 OP t1_iwmaat8 wrote
I actually saw this very example first, yeah it requires a good amount of computational power which my pc currently lacks. API calls I can but there would be rate limits to it, which needs to be payed to extend usage, that is why I have to drop that approach
I was actually looking for a non language model based approach for now, since language models are computation heavy. I am currently going to use symspell python library, since it is faster, though less accurate. Once I increase my Ram, I will surely start using LM since these are far better in accuracy. Thanks
Paid-Not-Payed-Bot t1_iwmabtx wrote
> to be paid to extend
FTFY.
Although payed exists (the reason why autocorrection didn't help you), it is only correct in:
-
Nautical context, when it means to paint a surface, or to cover with something like tar or resin in order to make it waterproof or corrosion-resistant. The deck is yet to be payed.
-
Payed out when letting strings, cables or ropes out, by slacking them. The rope is payed out! You can pull now.
Unfortunately, I was unable to find nautical or rope-related words in your comment.
Beep, boop, I'm a bot
Viewing a single comment thread. View all comments