Submitted by mjrossman t3_107i7yv in Futurology
Helpful_Opinion2023 t1_j3qavkk wrote
Reply to comment by mjrossman in A Singular Trajectory: the Signs of AGI by mjrossman
Is there a good starting point for a layperson to learn about the fundamentals of AI/ML/etc?
I come from a business/finance background, with an undergrad in business admin that was a decade ago. Do I need to basically learn computer science and programming concepts from scratch? Or is there a way to onboard myself to grasping the concepts of AI without basically restarting my education from scratch?
icedrift t1_j3qyl1w wrote
I gotchu. First thing you need to do is learn Python. You don't need to be a master by any means but you should understand variables, expressions, functions, classes, packages/dependencies, file systems, and basic algebra. Run through this amazing book and you'll understand plenty to get into the ML side of things.
Once you know a bit of Python complete this course Practical Machine Learning for Coders. This is an extremely highly regarded modern crash course to machine learning that is bringing a lot new people into the industry. In the very first lesson you'll build an image classifier that didn't even exist 5 years ago.
As you go deeper and deeper Math becomes more important but CS isn't really necessary.
unholyravenger t1_j3rwzvs wrote
For a conceptual understanding start with 3b1b . One of the best explanations of the underlying concepts I've seen. This is really the foundation of everything.
Next, there are 2 main concepts to understand, and that is how each layer of a NN works and the overall architecture. A quick list of layers to get your head around: Linear also called Multilayer Perceptron (MLP), CNN Convolutional Neural Network, then you have a family of layers that handles sequences like sentences. These are RNN, LSTM, and Transformers. But all of these are built on the same concepts as the 3b1b videos. If you're more of a math person this is a great way to conceptualize what each of these layers is doing.
Next different architectures. Start will simple classifiers, which you should already have a good understanding of. Then check out how GAN's work and how you can use two networks to train each other. Then maybe you can go to the state of the art with Diffusion networks. I think this is a bit easier to understand than how each layer works.
All the while playing around in python, and prepackaged ML stuff to apply your knowledge to something concrete. Make a simple classifier, download, and fine-tune a diffusion network on some dataset. Coursera has some really good classes, particularly by Andrew NG who is one of the biggest ML educators out there.
​
Bonus Resources:
ML Streat Talk: Podcast talking to people in the industry. Lots of deep concepts here.
2min Papers The hype channel. Learn about all the new stuff coming out.
Yannic Kilcher: Go deep into different papers written on ML and how they work at a very deep level.
​
Good luck it's a lot, but no one knows everything and you need surprisingly little to get started.
Viewing a single comment thread. View all comments