ole72444 OP t1_iwa5adf wrote on November 14, 2022 at 2:42 AM

Yes, i understand it looks like a data preprocessing problem (and it actually is). But this is a toy example to demonstrate if NNs can actually generalise functions that are of this sophistication.

FuB4R32 t1_iw8kirz wrote on November 13, 2022 at 7:56 PM

You could also do this at the input if its hard to edit the training data, e.g. in tensorflow https://www.tensorflow.org/api_docs/python/tf/gather

https://www.tensorflow.org/api_docs/python/tf/math/argmax

Generally, should look into custom operations like this to achieve what you want

ContributionWild5778 t1_iw95dyj wrote on November 13, 2022 at 10:10 PM

Agreed to this. It's a data pre-processing step making a model do this would be a very complicated task.

ole72444 OP t1_iwa5ljr wrote on November 14, 2022 at 2:45 AM

I'm trying to see if NNs can actually generalise such functions. I'm using the preprocessing that you've recommended to create the ground truth labels

HMasterSunday t1_iw9allv wrote on November 13, 2022 at 10:47 PM

also: numpy.split can create several cuts off of a numpy array so it simplifies to:

import numpy as np def process_data(input_array): cut_array = numpy.split(input_array, (len(array)/4)) max_array =[ ] for cut in cut_array: max_array.append(max(cut)) return max_array

much shorter, used this method recently so it's on the front of my mind

edit: don't know how to format on here, sorry

[deleted] t1_iw9aoiv wrote on November 13, 2022 at 10:47 PM

[deleted]

sckuzzle t1_iw9fe1i wrote on November 13, 2022 at 11:21 PM

Writing "short" code isn't a always good thing. Yes your suggestion has less lines, but:

It takes ~6 times as long to run
It does not return the correct output (split does not take every nth value, but rather groups it into n groups)

I'm absolutely not claiming my code was optimized, but it did clearly show the steps required to calculate the necessary output, so it was easy to understand. Writing "short" code is much more difficult to understand what is happening, and often leads to a bug (as seen here). Also, depending on how you are doing it, it often takes longer to run (the way it was implemented requires it to do extra steps which aren't necessary).

HMasterSunday t1_iw9qr8l wrote on November 14, 2022 at 12:46 AM

Interesting, I didn't try a test run to time both approaches, I'll do that more often. As per your other point though, my code does account for that already, the number of individual cuts is 1/4 of the length of the full array (len(input_array)/4) so it splits it up into arrays of length 4 anyways. That much I do know at least.

sckuzzle t1_iwaia67 wrote on November 14, 2022 at 4:34 AM

> As per your other point though, my code does account for that already

You may try running it? It returns [3.0, 8.0, 12.0, 8.0]. The intended output is [False, False, True, False]. OP didn't ask for it to be split into groups of four, they asked for every fourth value to be taken.

Making a model predict on the basis of a particular value

Comments

sckuzzle t1_iw8jv72 wrote on November 13, 2022 at 7:52 PM