Hey Reddit,

My friend and I are building a project management platform for AI/data science teams (essentially a JIRA for ML). We aim to develop a data-centric, experimental tool that models the ML pipeline to organize workflows, building off the Agile methodology of software development. Our tool will allow ML engineers to design, track, and manage custom pipelines, data flows, and models all on the cloud. Below of a list of some features we plan to introduce:

Integrations: Include a host of integrations to MLOps tools (KubeFlow, MLFlow, etc), cloud computing services (AWS, Google Cloud, Azure), source code management (Github, Bitbucket)

Iterations: Allow multiple iterations within pipelines, and separate each iteration by various steps in the ML pipeline (business understanding, data visualization, data pre-processing, model training, model testing, model optimization, and deployment). Include a Kanban chart per each part of the pipeline

Callbacks: The ability to request to go back to previous stages of the AI pipeline to either improve previous steps (like data preprocessing or model training/development/designing) or request other teams to improve previous steps (we refer to this as callbacks)

Storage: A cloud storage solution to store ML models, datasets, or any other metrics/graphs/whatever ML engineers want to store.

Sketchpad: A sketchpad to design data flows and ML models, and link them to code Private Assignment: The ability to individually/uniquely assign tasks to different roles in a team, and the ability to be able to privately and specifically send vital information to specific people. for example, the pm could only send the data set to the data engineer, the preprocessed data to an ML engineer (potentially added on top of all this is a differential privacy layer), and send the packaged model to an integration engineer.

Chat: A chat/communication platform to interact w/ your team Quantitative Focus: ML is quantitative. The client wants QUANTITATIVE results. Hence, the epic should be emphasized on being quantitative rather than qualitative.

Experiments: We redefine “sprints” as “experiments.” We make two changes to sprints. First, we DO NOT have any deadlines on any sprints. This is to not put the engineer under pressure. Secondly, instead of asking “what”, we ask “how” when asked to describe the experiment. This provides a heavily qualitative focus on the experiments, with a focus on function rather than immediate deliverability as in software engineering.

We would appreciate any feedback on our platform, as well as any problems you guys are facing in data science/ML project management.

Thanks a bunch in advance!

Comments

bmrheijligers t1_j00xz4x wrote on December 13, 2022 at 7:34 AM

#918,478

Head of data science here. Noble attempt. For a real world use case, reach out to me. Our daily workflow is a long way removed from the idealized image you sketch here.

seiqooq t1_j012of7 wrote on December 13, 2022 at 8:37 AM

#918,624

Replying to bmrheijligers (#918,478)

Curious as well if you don’t mind.

zenpianist t1_j01jvp7 wrote on December 13, 2022 at 12:22 PM

#919,234

I think real world ML Dev and OPS is much messier. Something as simple as a decoupled Inference pipeline would mean a lot to us, instead of having to retrigger the workflow when something failed. At TB scale, even snapshotting outputs from each stage became ridiculously expensive and downright impossible. Would love to see how you address those

testuser514 t1_j028v85 wrote on December 13, 2022 at 3:43 PM

#920,313

Replying to bmrheijligers (#918,478)

Would you be willing to expand here ?

Professional-Luck-64 t1_j03y9ya wrote on December 13, 2022 at 10:11 PM

#923,190

I’d be interested to learn and contribute what I can

bmrheijligers t1_j0czrw1 wrote on December 15, 2022 at 7:21 PM

#938,590

Replying to seiqooq (#918,624)

You are welcome! The main problem I face is that I have high volume, high volatility production requests for Statistical analysis come in. Support and change requests on top of that. And a few slow moving innovation projects where we are design better tools for ourself to handle the workload.

This means I need non agile production planning, resource management, ticketing, request management, result and report management,code and data management, shared knowledge base applied to the same group 15 people simultaneously ;) We 30-60 requests a week, 30% with daily moving deadlines and 80% of the work comes after first delivery and everything is handled over email.

We do our planning in excel 😂

Welcome to the Jungle 😎