sanman

sanman t1_j1qpdd0 wrote

What exactly is Validation data?
I know what Training data is for, and I know what Testing data is for.

But what is Validation data, and what is Validation for?

1

sanman t1_j1bacsy wrote

Well, remember when Youtube was totally free without any ads whatsoever? And of course we all wondered how they were going to continue offering their service for free. Then one day the ads crept in, and we knew.

I'm thinking OpenAI hasn't made this thing free just for generosity. They're using us as free beta-testers to shake down the product for them, so that they can iron out the kinks and bugs. Once that process has run its course, they'll just cut off our access and only allow paying customers to use it.

13

sanman t1_j1b5eb0 wrote

Is it possible to have Machine Learning for CAD designs? Could it be possible to train a model on a repository of CAD files?

Is highly structured vectorized data more efficient to train on compared to rasterized image repositories? How much more efficient?

1

sanman t1_j0ynjfi wrote

How to Handle Lots of Missing/Null Values in Data?

There's a data set that I've been given to analyze, and it's got a lot of missing data. Typically, I should replace missing values with mean, or mode, etc. But one particular column has nearly 70% null values. What is the threshold to reject a column as unsuitable for analysis, instead of trying to replace those missing values? How large a proportion of missing values is acceptable before I have to reject/discard the column altogether? Is there some rule of thumb for this?

1

sanman t1_ix41yew wrote

What Are Latest Cutting-Edge Applications in Generative Modeling?

Like everyone else, I've been playing with the new release of Stable Diffusion recently, and marveling at its output. I want to know what else is out there that makes use of Generative Modeling. What are the newest and most exciting things in development? I really want to know.

I can already see Generative Modeling being used for music. But beyond just artwork, what are other big fields or practical applications? What about CAD, for example? If a Machine Learning model was trained on enough CAD files of various types, could it learn how to design machinery, equipment, vehicles, buildings, etc? If a Machine Learning model was trained on lots of DNA samples categorically labeled according to their phenotypes, then could it learn how to make living things?

1

sanman t1_iwwys1t wrote

How Can I Boost PyTorch Performance On My Cheap Laptop?

Hi, I have a cheap laptop running Windows 10 and Intel Core I5-5300u processor with Intel 5500 GPU.

How can I find a way to make Pytorch code run faster on it? I tried looking at Intel Extensions for Pytorch, but they say 10th-gen processor is minimum requirement. I read about something called Zluda which claims to be able to run CUDA-enabled code without modifications, but that seems to only be for Linux and I don't know what the minimum processor requirements are.

Is there some way to use my Intel GPU to help out my execution time?

1