sugar_scoot t1_jajjyiq wrote on March 1, 2023 at 9:56 PM

#2,140,605

I'm not an expert but I believe the use case is if you're in an environment where you have no gradient to learn from, or even, without the hope of approximating a gradient to learn from.

Mortal-Region t1_jajoks0 wrote on March 1, 2023 at 10:26 PM

#2,140,762

I think genetic fuzzy trees are being considered for unmanned aerial combat.

pnkdjanh t1_jajp9k8 wrote on March 1, 2023 at 10:31 PM

#2,140,785

I believe genetic algorithms would find its uses in optimisation of emergence behaviour - in biological analogy, if NN is akin to evolving a brain, then GA would be like evolving a colony / society.

currentscurrents t1_jajpjj7 wrote on March 1, 2023 at 10:33 PM

#2,140,793

It's not dead, but gradient-based optimization is more popular right now because it works so well for neural networks.

But you can't always use gradient descent. Backprop requires access to the inner workings of the function, and requires that it be smoothly differentiable. Even if you can use it, it may not find a good solution if your loss landscape has a lot of bad local minima.

Evolution is widely used in combinatorial optimization problems, where you're trying to determine the best order of a fixed number of elements.

Hostilis_ t1_jak681p wrote on March 2, 2023 at 12:31 AM

#2,141,440

Replying to currentscurrents (#2,140,793)

>But you can't always use gradient descent. Backprop requires access to the inner workings of the function

Backprop and gradient descent are not the same thing. When you don't have access to the inner workings of the function, you can still use stochastic approximation methods for getting gradient estimates, e.g. SPSA. In fact, there are close ties between genetic algorithms and stochastic gradient estimation.

themrzmaster t1_jak7tzi wrote on March 2, 2023 at 12:43 AM

#2,141,495

Not genetic, but you might find cool this paper https://arxiv.org/abs/2302.06675 Program search inspired by evolution

AdFew4357 t1_jak9487 wrote on March 2, 2023 at 12:53 AM

#2,141,552

Is this type of math optimization?

topcodemangler t1_jakalh1 wrote on March 2, 2023 at 1:03 AM

#2,141,600

Replying to themrzmaster (#2,141,495)

For me an always interesting and alluring idea was to use GA to search for a combination of elementary information processing stuff (probably Boolean gates) and memory which would result in some novel ML architecture. Maybe much more effective than NN as it would be possible to directly implement via electronics without the overhead.

M_Alani t1_jakapj2 wrote on March 2, 2023 at 1:04 AM

#2,141,607

Oh brings back a lot of memories. I remember using it in the early 2000s to optimize neural networks. Back when only Matlab was there and we couldn't afford it and had to build NN from scratch.... using Visual Basic 😢

Back to your question, I don't think they're dead. Probably their use in NN is. Edit:spelling

protonpusher t1_jakcdc6 wrote on March 2, 2023 at 1:17 AM

#2,141,680

No. See this recent pub in Nature.

_simple_machine_ t1_jakjnnf wrote on March 2, 2023 at 2:10 AM

#2,141,978

Replying to themrzmaster (#2,141,495)

This is really interesting. Do you know of any approaches similar to this for learning hyperparameters layer structure, or complications such as inception networks or resnet?

Kitchen_Tower2800 t1_jakjrxr wrote on March 2, 2023 at 2:11 AM

#2,141,981

I've never directly worked with either, but isn't RL agent-competitions approaches (i.e. simulating games between agents with different parameter values and iterating on this agents) a form of genetic algorithms?

It's also worth noting that this is exactly the type of problem that genetic algorithms were made for: no gradients, highly multimodal.

bbateman2011 t1_jakl8m8 wrote on March 2, 2023 at 2:22 AM

#2,142,026

I use GA optimization for non-convex problems, mainly hyperparameter optimization. Sometimes it’s very effective but I’ve not found a way to know ahead of time if it will outperform other algorithms.

csinva t1_jakmagd wrote on March 2, 2023 at 2:29 AM

#2,142,082

I think genetic algorithms may have a new role to play in problems involving inference / text generation / prompting with language models, even if they aren't used to train the models themselves.

For example, in our recent work on natural-language prompting, we use a genetic algorithm to generate prompts that are semantically coherent -- the genetic algorithm lets us make use of suggestions by a language model, for which gradients would be hard to obtain.

Deep_Sync t1_jakn00h wrote on March 2, 2023 at 2:35 AM

#2,142,109

One of my friend use genetic algorithms to do acoustic material research.

Red-Portal t1_jakr3yf wrote on March 2, 2023 at 3:05 AM

#2,142,304

The fundamental problem with evolutionary strategies is that they are a freakin nightmare to evaluate. It's basically impossible to reason about their mathematical properties, experiments are noisy as hell, and how representative are the benchmark objective functions anyway? It's just really hard to do good science with those, which means it's hard to make concrete improvement. Sure, once upon a time they were the only choice for noisy, gradient free global optimization problems. But now we have Bayesian optimization.

discord-ian t1_jakt2gl wrote on March 2, 2023 at 3:21 AM

#2,142,377

I still see papers written on them occasionally. I have always wanted to implement one, but I've never had a use case. I think there are certain categories of problems where they excel, but in the real world, most of the time, there seems to be a better approach.

One real-world use case I saw was using genetic algorithms to design an automobile brake rotor to reduce heat (or increase heat dissipation). From what I remember of the presentation... Basically, they had a very large number of mathematical definable designs with many input variables. The interactions between these different variables were not necessarily clear. Elements of one of these designs might combine well with elements from a totally separate design. And the model to test them was computationally expensive.

They were able to use this genetic algorithm to design a rotor that, at least on the computer, was meaningfully better than their companies (and likely the industry's) state of the art.

ab3rratic t1_jakzbv6 wrote on March 2, 2023 at 4:12 AM

#2,142,629

GAs are not great for expensive-to-evaluate functions. And those have become kind of relevant lately.

deviantkindle t1_jal4ff1 wrote on March 2, 2023 at 5:00 AM

#2,142,837

Replying to discord-ian (#2,142,377)

My fave has always been the radio antenna designed by a GA

WikiSummarizerBot t1_jal4gkz wrote on March 2, 2023 at 5:00 AM

#2,142,838

Replying to deviantkindle (#2,142,837)

Evolved antenna

>In radio communications, an evolved antenna is an antenna designed fully or substantially by an automatic computer design program that uses an evolutionary algorithm that mimics Darwinian evolution. This procedure has been used in recent years to design a few antennas for mission-critical applications involving stringent, conflicting, or unusual design requirements, such as unusual radiation patterns, for which none of the many existing antenna types are adequate.

^([ )^(F.A.Q)^( | )^(Opt Out)^( | )^(Opt Out Of Subreddit)^( | )^(GitHub)^( ] Downvote to remove | v1.5)

Dendriform1491 t1_jalb2vb wrote on March 2, 2023 at 6:09 AM

#2,143,099

Genetic algorithms require you to create a population where the genetic operators are applied (mutation, crossover and selection).

Creating a population of neural networks implies having multiple slightly different copies of the neural network to be optimized (i.e.: the population).

This can be more computationally expensive than other techniques which will do all the learning "in-place".

[deleted] t1_jalc0j9 wrote on March 2, 2023 at 6:20 AM

#2,143,136

[deleted]

th1nk2much t1_jalc88z wrote on March 2, 2023 at 6:23 AM

#2,143,143

I recently used a genetic algorithm in a supply chain application. Not the fastest algo but we made it work for our purpose

visarga t1_jalgrla wrote on March 2, 2023 at 7:18 AM

#2,143,281

Have you seen this paper?

Evolution through Large Models

https://arxiv.org/abs/2206.08896

visarga t1_jalh1r1 wrote on March 2, 2023 at 7:21 AM

#2,143,294

Replying to Dendriform1491 (#2,143,099)

You don't always need a population of neural networks, it could be a population of prompts or even a population of problem solutions.

If you're using GA to solve specific coding problems, then there is one paper where they use LLM to generate diffs for code. The LLM was the mutation operator, and they even fine-tune it iteratively.

FinancialElephant t1_jaliqsh wrote on March 2, 2023 at 7:43 AM

#2,143,350

Genetic optimization might be dead in most cases. I think a lot of the ideas aside from optimization algorithms are still relevant.

I've found GP techniques can yield parsimonious models. A lot of the big research these days is on big models, but GP seems good for small, parsimonious, and elegant models. Good for low data regimes, specialized problems, and problems where you have expert knowledge you can encode. Generally speaking I like working with GP becuase you end up with a parsimonious and interpretable model (opposite of a lot of NN research).

In practice I've found importance sampling methods to work about as good as genetic optimization for optimizing GP trees/grammars for the small amount of work I did with them. I haven't found either method to edge out by much, but it could depend on the problem.

I don't know if this is considered GP (or GA) without a genetic optimization method. However I think we can say that the notion of optimizing a symbolic tree or grammar was heavily developed within GP, even if today you may use some monte carlo optimization method in practice.

sobe86 t1_jalldpg wrote on March 2, 2023 at 8:18 AM

#2,143,427

Replying to sugar_scoot (#2,140,605)

Plus also there needs to be a learnable, nontrivial 'strategy' to take advantage of, otherwise it's not going to beat simulated annealing except on speed. The couple of times I've used it in practice, SA was about as good as we could get performance-wise.

serge_cell t1_jalnarf wrote on March 2, 2023 at 8:45 AM

#2,143,487

The notable diffrence between GA and other random searches is cross-over operator, and in it's theory "building blocks" hypothesis. Neither were confirmed during years (dozens of years) of attemted use of GA.

PassionatePossum t1_jalnb1q wrote on March 2, 2023 at 8:45 AM

#2,143,488

I think, my professor summarized it very well: "Genetic algorithms is what you do when everything else fails."

What he meant by that is, that they are very inefficient optimizers. You need to evaluate lots and lots of configurations because you are stepping around more of less blindly in the parameter space and you are only relying on luck and a few heuristics to improve your fitness. But their advantage is that they will always work as long as you can define some sort of fitness function.

If you can get a gradient, you are immediately more efficient because you already know in which direction you need to step to get a better solution.

But of course there is room for all algorithms. Even when you can do gradient descent, there are problems where it quickly gets stuck in a local optimum. There are approaches how to "restart" the algorithm to find a better local optimum. I'm not that familiar with that kind of optimization but it is not inconceivable that genetic algorithms might have a role to play in such a scenario.

[deleted] t1_jalq5f9 wrote on March 2, 2023 at 9:25 AM

#2,143,550

Replying to M_Alani (#2,141,607)

[deleted]

[deleted] t1_jalsgjd wrote on March 2, 2023 at 9:58 AM

#2,143,599

[removed]

Hunterhal t1_jalt078 wrote on March 2, 2023 at 10:06 AM

#2,143,612

Hyperparam optimization?

drplan t1_jalud65 wrote on March 2, 2023 at 10:26 AM

#2,143,641

Genetic algorithms are still useful for strange objective functions that defy analytical approaches, such as anything based on complex simulations. But it somehow has always been this way.

Nowadays things have changed by generative models for code generation. A few years ago Genetic Programming (and it's many variants) was the only approach to do this, now some problem can just be solved by asking a language model to write the code for xyz.

themrzmaster t1_jalxwx1 wrote on March 2, 2023 at 11:13 AM

#2,143,749

Replying to _simple_machine_ (#2,141,978)

Yes, there is a whole field https://arxiv.org/abs/2008.10937

ID4gotten t1_jalytoe wrote on March 2, 2023 at 11:24 AM

#2,143,773

Replying to protonpusher (#2,141,680)

This is actually a very cool approach, if more narrow in application. Other domains that have these linear representations like the Morgan fingerprint might also benefit.

M_Alani t1_jam3i7i wrote on March 2, 2023 at 12:18 PM

#2,143,933

Replying to [deleted] (#2,143,550)

It wasn't as bad as it sounds. The fun part was that you had to understand how every little piece of the algorithm works, and the nightmare was implementing all of this with 512mb of RAM. We didn't have the luxury of trying different solutions.

BigBayesian t1_jam6z7u wrote on March 2, 2023 at 12:52 PM

#2,144,047

Genetic algorithms are good, as you said, when you really understand the space and can come up with a really good candidate generation system. They’re okayish (or, the same as everything else) when you have no understanding of the space at all, and you’re just totally guessing. They can’t latch onto a curve in design space as well as things that look at a simpler gradient can. So maybe they’re best used for really complex spaces where gradient based methods don’t do well. The kind of places you’d use Gibbs sampling, or general optimization algorithms.

So, basically, they’re useful when you have good feature engineering already done, like many methods that have fallen out of vogue in the age of letting algorithms and data do your feature engineering for you. And they’re as good a shot in the dark as any when standard methods fail and you’ve got no clue how to proceed.

So, yeah, the number of times genetic algorithms are the “right” choice is pretty limited these days.

rm-rf_ t1_jambi6b wrote on March 2, 2023 at 1:32 PM

#2,144,249

Replying to sugar_scoot (#2,140,605)

Don't Bayesian approaches generally work better in gradient-free optimization?

TobusFire OP t1_jamcrd2 wrote on March 2, 2023 at 1:43 PM

#2,144,298

Replying to [deleted] (#2,143,136)

This is a reasonable question but I believe you are misunderstanding. The randomization of parameters in a neural network (I assume you are talking about initialization?) is certainly not the same as a mutation in a GA. Mutation occurs randomly, sure, but is selected for and crossed over, whereas hill-climbing and gradient descent simply move on the gradient and do not use either random mutations or cross-over so are not genetic.

SpookyTardigrade t1_jaml6d9 wrote on March 2, 2023 at 2:46 PM

#2,144,666

Replying to Hostilis_ (#2,141,440)

Can you give a few examples of how genetic algorithms and stochastic gradient estimation are related?

IanisVasilev t1_jamldhj wrote on March 2, 2023 at 2:47 PM

#2,144,672

It turned out that other models have superior genes.

filipposML t1_jamongz wrote on March 2, 2023 at 3:10 PM

#2,144,811

Replying to M_Alani (#2,141,607)

We recently published an evolutionary method to sample from the latent space of a variational autoencoder. It is still alive and well. Just a bit niche.

[deleted] t1_jamslnp wrote on March 2, 2023 at 3:37 PM

#2,144,992

[removed]

risoo7 t1_jamu5h2 wrote on March 2, 2023 at 3:47 PM

#2,145,084

In one of our recent work https://arxiv.org/abs/2012.14956, we used Genetic Algorithm to attack NLP models in a hard label black box setting, where we do not have access to the confidence scores of the model.

Readityesterday2 t1_jamx6h5 wrote on March 2, 2023 at 4:07 PM

#2,145,207

You can say they have gone extinct in the ecosystem of completion from superior approaches 😂

nfmcclure t1_jamy7yz wrote on March 2, 2023 at 4:14 PM

#2,145,249

I don't think they are dead. Their popularity for NNs is much lower for sure.

In general, GAs can theoretically solve any problem (if you can formulate a fitness function), given long enough time. Because of that, I think they will always have some use cases.

hershey678 t1_jamyfyh wrote on March 2, 2023 at 4:15 PM

#2,145,259

Replying to Hunterhal (#2,143,612)

Yeah it's used for automatic hyperparameter tuning.

Better than a grid or just intuition.

-EmpiricalEvidence- t1_jan0mqg wrote on March 2, 2023 at 4:30 PM

#2,145,373

Replying to Dendriform1491 (#2,143,099)

Exactly due to the computational demands I don't think genetic algorithms have ever really been "alive", but with compute getting cheaper I could see it seeing success similar to the rise of Deep Learning.

Evolution Strategies as stabilizers without the genetic component are already being deployed quite well e.g. AlphaStar.

Jeff Clune was quite active in that area of research and he recently joined DeepMind.

https://twitter.com/jeffclune/status/1629132544255070209

HackZisBotez t1_jan2bme wrote on March 2, 2023 at 4:41 PM

#2,145,463

Replying to protonpusher (#2,141,680)

Sorry to be that person, but that's not Nature, that's Scientific Reports, which is a tier 3 journal in the Nature portfolio. If Nature would be compared to a top conference, Scientific reports would be less than a workshop paper.

protonpusher t1_jan2jpe wrote on March 2, 2023 at 4:42 PM

#2,145,478

Replying to HackZisBotez (#2,145,463)

No, good on you for being that person. I just looked at the prefix of the URL. Thx!

ahf95 t1_jan46jv wrote on March 2, 2023 at 4:52 PM

#2,145,557

They are certainly still used for RL (and other cases where you don’t have a gradient), but even in those contexts there have been modern advancements that cause the preferred algorithms to diverge from old-school genetic algorithms. For instance, things like Particle Swarm Optimization and the Cross Entropy Method have their conceptual origins in the similar sampling regimes as MCMC approaches, but they’ve become their own entities at this point, outperforming genetic algorithms, and really being unique and broad enough to get their own categories.

extracensorypower t1_jan4yel wrote on March 2, 2023 at 4:57 PM

#2,145,590

I think they're still useful for "no information at all" scenarios where attempting a solution is just too time consuming or not possible using other methods (e.g. traveling salesman problem).

As a practical matter, I think they're best integrated with other methods as "first cut" solutions that get you closer to something you can work out with a neural net or rule based system.

That said, I'm unaware of any NN or rule based solution better than a GA for solving the traveling salesman problem even now. So, maybe some P-NP problems will always be best attacked with GAs.

mmmniple t1_jan7i1a wrote on March 2, 2023 at 5:14 PM

#2,145,719

Replying to filipposML (#2,144,811)

It sounds very interesting. Is it available to read? Thanks

[deleted] t1_jan7rz4 wrote on March 2, 2023 at 5:15 PM

#2,145,738

[deleted]

marcus_hk t1_jan8rmh wrote on March 2, 2023 at 5:22 PM

#2,145,792

They might see a resurgence in dynamic multi-agent environments.

scawsome t1_janbtva wrote on March 2, 2023 at 5:41 PM

#2,145,926

Replying to rm-rf_ (#2,144,249)

Not necessarily. Bayesian methods work great when you have expensive objective function evaluations that can only be evaluated in serial (or limited parallel evaluations). Bayesian methods aren't ideal in massively parallelizable evaluations (evaluating >100 points at a time) or when evaluations are relatively cheap. It depends on the cost of optimizing the acquisition function. I've actually played around with combining BO with evolutionary algorithms to extend BO towards massively parallelizable evaluations and have seen some promising results.

dragosconst t1_janbuui wrote on March 2, 2023 at 5:41 PM

#2,145,928

I think there are few problems were a couple extra assumptions that could make much more efficient methods work (not NNs necessarily of course) don't hold. I'm not sure there exist problems where genetic algos outperform other methods, disregarding problems where only genetic algos work.

proton-man t1_janca53 wrote on March 2, 2023 at 5:44 PM

#2,145,951

Replying to [deleted] (#2,143,550)

It was. Dumb too. Because of the limitations of memory and computing power at the time you had to constantly tweak parameters to optimize learning speed, avoid overfitting, avoid local optimums, etc. Only to find that the best performing model was the one generated by your 2 AM code with the fundamental flaw and the random parameters you chose while high.

noeda t1_janf9cr wrote on March 2, 2023 at 6:03 PM

#2,146,058

I use CMA-ES (a type of evolutionary algorithm) for training neural networks for finance stuff. The neural networks involved are not superhuge so it works out (IIRC the number of parameters is around ~500-1000).

The fitness function is pretty complicated and written in Rust and I put a lot of effort to making it fast because these algorithms need to evaluate it many many times. I feel using evolutionary algorithms makes coding simpler because you do not need to care that whatever you are writing is differentiable or that some backprop/gradient descent library needs to be able to "see" inside your function.

I do think my use case is a bit more niche. I live in hope that some breakthrough happened that made evolutionary algorithms practically usable for large neural networks.

[deleted] t1_janfihy wrote on March 2, 2023 at 6:04 PM

#2,146,070

Replying to PassionatePossum (#2,143,488)

[deleted]

ajt9000 t1_janfj0c wrote on March 2, 2023 at 6:05 PM

#2,146,072

Who says genetic algorithms are dead? They're pretty much dead for training neural nets absolutely, but there are tons of other more general optimization problems that GAs (or more generally evolutionary algorithms) are well suited for.

Not to mention they still have plenty of utility as a search algorithm for hyperparameters so they aren't even dead for neural applications.

ajt9000 t1_janfx47 wrote on March 2, 2023 at 6:07 PM

#2,146,085

Replying to sugar_scoot (#2,140,605)

This comment make me wonder if the same rules about using one-hot encoding instead of ordinal encoding for classifiers still apply to a neural net trained with a gradient-less search algorithm like a GA instead of backprop.

avialex t1_janjx6r wrote on March 2, 2023 at 6:37 PM

#2,146,233

Replying to mmmniple (#2,145,719)

Appears to be here: https://openreview.net/forum?id=ibNr25jJrf

edit: actually after reading it, I don't think this is the referenced publication, but it's still interesting

M_Alani t1_janluu8 wrote on March 2, 2023 at 6:51 PM

#2,146,324

Replying to proton-man (#2,145,951)

I can't disagree.

Downtown_Finance_661 t1_janm2nt wrote on March 2, 2023 at 6:53 PM

#2,146,343

Replying to M_Alani (#2,143,933)

Fun story! How you have chosen hyper-parameters for models? Have you turn them over in for-loops?

M_Alani t1_janmj7j wrote on March 2, 2023 at 6:56 PM

#2,146,372

Replying to Downtown_Finance_661 (#2,146,343)

Mostly. Other times I would interrupt the code when it wasn't converging and start over after changing a parameter or two. I feel si spoiled with Tensorflow now!

lmericle t1_jannla8 wrote on March 2, 2023 at 7:04 PM

#2,146,422

The trick with genetic algorithms is you have to tune your approach very specifically to the kinds of things you're modelling. Different animals mate and evolve differently, in the analogical view.

It's not enough to just do the textbook "1D chromosome" approach. You have to design your "chromosome", as well as your "crossover" and "mutation" operators specifically to your problem. In my experience, the crossover implementation is the most important one to focus on.

EducationalCicada t1_januat3 wrote on March 2, 2023 at 7:50 PM

#2,146,725

Replying to topcodemangler (#2,141,600)

Aren't people working on ways to implement NN in hardware?

TobusFire OP t1_janxqya wrote on March 2, 2023 at 8:12 PM

#2,146,838

Replying to sobe86 (#2,143,427)

My thoughts too. Simulated annealing and similar strategies seem to intuitively be better is most cases where traditional gradient methods aren't applicable. I can imagine a handful of cases where genetic algorithms MIGHT be better, but even then I am not fully convinced and it just feels gimmicky.

TobusFire OP t1_jany7dr wrote on March 2, 2023 at 8:15 PM

#2,146,851

Replying to csinva (#2,142,082)

Cool idea, thanks for sharing!

Character_Internet_3 t1_janyfg3 wrote on March 2, 2023 at 8:17 PM

#2,146,867

Seems like those algorithms sunk in a global minimum

TobusFire OP t1_janyxnp wrote on March 2, 2023 at 8:20 PM

#2,146,896

Replying to Kitchen_Tower2800 (#2,141,981)

> isn't RL agent-competitions approaches (i.e. simulating games between agents with different parameter values and iterating on this agents) a form of genetic algorithms?

Hmm, I hadn't thought about RL like that. I guess the signal from a reward function based on competition could be considered "fitness", and then perhaps some form of cross-over is done in the way we iterate on and update the agents. Interesting thought.

TobusFire OP t1_janz44w wrote on March 2, 2023 at 8:21 PM

#2,146,907

Replying to EducationalCicada (#2,146,725)

Absolutely, lots of great work is being done in this domain right as we speak. Neuromorphic computing and analog computing I personally think are some of the most exciting things to look out for in the next 10 or so years.

TobusFire OP t1_janzia9 wrote on March 2, 2023 at 8:24 PM

#2,146,925

Replying to extracensorypower (#2,145,590)

Agreed. That being said, I think the prior is that you still need to have enough understanding of the state space to be able to design good mutations, cross-over, and fitness. This can easily add a lot of overhead. In contrast, I think that other cool methods like swarm optimization and ant colony optimization are also promising and in some ways simpler.

TobusFire OP t1_janzoec wrote on March 2, 2023 at 8:25 PM

#2,146,935

Replying to serge_cell (#2,143,487)

Cool! I'd never heard of the building block hypothesis before, thanks for sharing.

TobusFire OP t1_janzsj9 wrote on March 2, 2023 at 8:25 PM

#2,146,940

Replying to lmericle (#2,146,422)

> In my experience, the crossover implementation is the most important one to focus on

I've heard this as well

extracensorypower t1_jao08hm wrote on March 2, 2023 at 8:28 PM

#2,146,961

Replying to TobusFire (#2,146,925)

Hmm. Hadn't thought of that, but that's probably true. I'm less familiar with the mechanics of these but I bet they'll be similarly good for low or no-information scenarios or P-NP problems.

TobusFire OP t1_jao0dm8 wrote on March 2, 2023 at 8:29 PM

#2,146,970

Replying to drplan (#2,143,641)

Interesting, thanks for sharing your thoughts! I'm a bit curious about why genetic algorithms might be better for these strange objective functions, as compared to something like simulated annealing. I can understand that a pure gradient method could easily be insufficient, but do the underlying components of genetic algorithms (like cross-over, etc.) really provide a distinct advantage here? Especially when the fitness is probably directly related to the gradient anyways

TenaciousDwight t1_jao0jq8 wrote on March 2, 2023 at 8:30 PM

#2,146,978

I think GA is still fairly popular in operations management to e.g. produce solutions for delivery scheduling problems.

rflight79 t1_jao1dn7 wrote on March 2, 2023 at 8:35 PM

#2,147,014

Our lab made a python package that combines simulated annealing and genetic algorithms for helping to solve really gnarly inverse problems in chemistry modeling. The package is on GH, and here is the link to the paper where it was primarily used.

TobusFire OP t1_jao5dsz wrote on March 2, 2023 at 9:01 PM

#2,147,165

Replying to rflight79 (#2,147,014)

I love it! Great to see these kind of applications

mmmniple t1_jao9iuk wrote on March 2, 2023 at 9:27 PM

#2,147,319

Replying to avialex (#2,146,233)

Thanks

[deleted] t1_jaodn5k wrote on March 2, 2023 at 9:53 PM

#2,147,457

[removed]

filipposML t1_jaooo3f wrote on March 2, 2023 at 11:09 PM

#2,147,814

Replying to avialex (#2,146,233)

Hey, this is it actually! We are optimizing a discrete variational autoencoder with no gumbel-softmax trick.

filipposML t1_jaop5vw wrote on March 2, 2023 at 11:13 PM

#2,147,829

Replying to filipposML (#2,147,814)

Of course we require no encoding model, so the notion of a latent space only holds up until closer inspection.

filipposML t1_jaopq43 wrote on March 2, 2023 at 11:17 PM

#2,147,846

Replying to mmmniple (#2,145,719)

The latest version is here: https://2022.ecmlpkdd.org/wp-content/uploads/2022/09/sub_1229.pdf

mmmniple t1_jaopv6r wrote on March 2, 2023 at 11:18 PM

#2,147,848

Replying to filipposML (#2,147,846)

Thanks

avialex t1_jap04wq wrote on March 3, 2023 at 12:33 AM

#2,148,295

Replying to filipposML (#2,147,829)

I was kinda excited, I had hoped to find an evolutionary algorithm to find things in a latent space, I've been having a hell of a time trying to optimize text encodings for diffusion models.

Hostilis_ t1_jap97r5 wrote on March 3, 2023 at 1:42 AM

#2,148,651

Replying to SpookyTardigrade (#2,144,666)

https://www.nature.com/articles/s41467-021-26568-2

Try this article

Ularsing t1_japrrrj wrote on March 3, 2023 at 4:10 AM

#2,149,405

No way. They're still one of the best bets out there for high-dimensional discrete optimization.

filipposML t1_jaq6tpq wrote on March 3, 2023 at 6:42 AM

#2,149,976

Replying to avialex (#2,148,295)

You just need a notion of a fitness function and then you can apply permutations to the tokens.

filipposML t1_jaq6whb wrote on March 3, 2023 at 6:43 AM

#2,149,979

Replying to mmmniple (#2,147,848)

Cheers

sea-shunned t1_jaqosa2 wrote on March 3, 2023 at 10:45 AM

#2,150,446

Replying to PassionatePossum (#2,143,488)

In my experience, if you are "stepping around more or less blindly" then the problem & EA have not been properly formulated. In general of course, if a gradient is available then gradient descent will do a better job >99% of the time.

Though with a bit of domain knowledge, and/or some careful design of the objective function(s), variation operators etc., an EA can be a pretty efficient explorer of the space. It's nichely-applicable, but when done properly it's far from blind.

drplan t1_jav01pf wrote on March 4, 2023 at 7:22 AM

#2,157,239

Replying to TobusFire (#2,146,970)

I think the best approach for this is thinking about the search space and the fitness landscape. If different components of the solution vector can independently improve the fitness crossover operators will have a positive impact.

Another aspect is the search space itself. Is it real-valued, is it binary, is it a tree-like structure,..?

Traditionally genetic algorithms are operating on binary encodings, and they often work ok problem which have binary solutions (a fixed-size vector of bits). These problem do not have gradient to start with. However one should investigate beforehand if there are combinatorial approaches to solve the problem.

For real-valued problems with no gradient: evolution strategies with a smart mutation operation like CMA (covariance matrix adaption) would be a good choice.

ACH-S t1_jawnajq wrote on March 4, 2023 at 5:21 PM

#2,159,686

I'm not sure whether you mean genetic algorithms or evolutionary algorithms or if those terms are interchangeable for you (often, they are not). Anyway, a field that heavily relies on them is Quality-Diversity (https://quality-diversity.github.io/, there is a nice list of papers there). Also, I would recommend that you have a look at the proceedings from the GECCO conference (e.g. https://dl.acm.org/doi/proceedings/10.1145/3512290 , the conference is much smaller than neurips/ICML/etc, and the research quality tends to be a bit more variable, but you'll see that evo algortihms, and in particular genetic ones are far from being dead).

The idea that "designing an experiment for a genetic algorithm requires sufficient prior" doesn't sound correct to me, generally you turn to them when you don't have any reliable priors on the search space (as other comments have pointed out, see CMA-ES as an example. I'll add ES https://arxiv.org/abs/1703.03864 as another useful example that I've personally often used to simplify meta-learning problems).

_TheHalfTruth_ t1_jbdaf27 wrote on March 8, 2023 at 5:21 AM

#2,183,836

Replying to rm-rf_ (#2,144,249)

Metaheuristic algorithms like GA and simulated annealing are almost identical to Bayesian methods/MCMC. Metaheuristic algorithms are Bayesian methods if you can pretend that your objective function is proportional to a probability distribution that you want to maximize. They just take unique approaches to exploring the posterior distribution. But conceptually they’re identical

Comments