Comments

You must log in or register to comment.

SnthesisInc OP t1_jcy937l wrote

Data from r/datascience poll.

Infographic made with Adobe Illustrator.

−7

Omaha4Loot t1_jcy9n21 wrote

Finding data that is applicable to the premise at hand and deciding whether has value is always the biggest question for me

1

DAFPPB t1_jcy9trh wrote

Iā€™m being a bit nit picky but either colouring the text to match up with the pie or creating some numeric legend would improve the readability.

It matches up with the lifecycle during a sprint. šŸ˜‚

27

Hsinats t1_jcyaf3t wrote

This is a tough graph to read easily; the colour is completely redundant. You might as well have just shown the table at the bottom because it shows all the information without having to cross check with anything else.

16

DutchVortex t1_jcybdk3 wrote

Cleaning was nearly 80% of my work for some projects...

3

FluffzMcPirate t1_jcyd23d wrote

I don't like this graph. Why add color if you're not gonna use it. You could leave out the piechart from this picture and it would be easier to understand.

10

hopingforabetterpast t1_jcyfn5w wrote

17% color coding the pie but not the labels

12% not ordering the slices by size nor chronologically

37% no sample size and no sources

53% percentages in this comment not adding up to 100%

23

moglito t1_jcypn36 wrote

A legend that uses values as keys? That's so silly. What if two of the values were the same?

2

Psidom t1_jcyu18z wrote

People don't have to deal with data size and scalability issue? For me, I found it quite a pain having to make sure data fits the machine for in memory analytics. And handle OOM even when using parallel engine like Apache Spark.

1

Rear-gunner t1_jedeonr wrote

The results of the poll are very much my experience.

1