Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks

MSaiRam10 · 2026-05-02T14:31:38.000Z 1777732298

Notebooks as the output format is funny because notebooks are famously bad for reproducibility. Out of order execution, hidden state, etc. You're solving "chat isn't reproducible" with a format that also isn't really

pplonski86 · 2026-05-04T07:51:56.000Z 1777881116

Python notebooks are not reproducible when used by humans. When notebook format is used to store conversations for AI data analysis, it preserves the chat history and is ideal for reproducibility.

trymamboapp · 2026-05-04T13:15:55.000Z 1777900555

"AI saves analysis as notebooks" is fighting the wrong fight ig. The reproducibility issue with notebooks isn't the format. it's out-of-order cell execution and silent kernel state

llm generation makes that worse: the model has no memory of what state existed when it wrote cell 7, and neither does the user.

pplonski86 · 2026-05-04T14:14:42.000Z 1777904082

User is not touching notebook at all, user just ask questions in natural language, and AI is using Python to compute answer, the ipynb notebook format is used to save the conversation.

hasyimibhar · 2026-05-02T14:48:45.000Z 1777733325

How does this compare to open source Deepnote[0]? We use the cloud version (BYOC) at my previous company to replace self-hosted Jupyter notebooks, and it's pretty great.

[0] https://github.com/deepnote/deepnote

pplonski86 · 2026-05-04T08:03:12.000Z 1777881792

The goal of MLJAR Studio is to make it easy to analyze data for people with large domain knowledge but lack of programming skills. We do not focus on notebooks. Python notebook for us is compute and store layer. Our main interface is chat with AI data analyst. The conversation can be opened as classic notebook, but the main UI is simple chat.

hasyimibhar · 2026-05-05T02:32:08.000Z 1777948328

You should check them out, their interface pretty much looks like chat nowadays.

pplonski86 · 2026-05-05T07:14:42.000Z 1777965282

Thank you! I will check them out. It is worth to mention that MLJAR Studio is a desktop application, which is easy to install. It is running locally, and support local LLMs so all data stay safe.

2ndorderthought · 2026-05-02T10:40:07.000Z 1777718407

This is one of those product areas I would call high-risk without a human in the loop. So I am glad you kept a person in the loop. It's really easy to lose tons of money making decisions based on bad statistics or models. Anyone remember how much money zillow lost because of automatic time series models?

I do have concerns about the workflow. Data people aren't usually the best programmers. Models hallucinate and make mistakes sometimes subtle sometimes not. Can you think of a way to prevent data scientists from having to be expert code reviewers? I feel like taking away the code gives them the chance to find and fix mistakes in their reasoning but I have no evidence for that.

pplonski86 · 2026-05-04T08:28:17.000Z 1777883297

Human in the loop in data analysis is really challenging task. We provide Python code for inspection, so user can check details how results were produced. Additionally, we run AI on results - user need to check the outputs and AI provided insights.

amirathi · 2026-05-02T12:17:16.000Z 1777724236

Really cool. If somebody doesn't want to adopt a new platform, take a look at open source Jupyter MCP Server[1]. Once integrated with Claude, it can execute code on the live notebook kernel.

I just let Claude write notebooks, run top to bottom, debug & fix errors & only ping me when everything is working.

[1] https://github.com/datalayer/jupyter-mcp-server

pplonski86 · 2026-05-04T08:08:34.000Z 1777882114

Thanks for sharing! MLJAR Studio was created for people with domain knowledge but not much technical expertise. For them, setting up a Python environment, installing required packages, configuring Jupyter Lab, the MCP server, and Claude Code might be technically demanding.

MLJAR Studio is a desktop application available for Windows, MacOS, and Linux. MLJAR Studio creates a Python environment for the user and installs all required packages. The user can focus on data rather than fighting technical challenges.

estetlinus · 2026-05-02T12:03:36.000Z 1777723416

This is one shot with Claude Code. What’s the moat?

2ndorderthought · 2026-05-02T13:51:55.000Z 1777729915

Not the op or affiliated but.

You really shouldn't and often cannot legally send off data or information about data to 3rd parties. Maybe schemas are okay but 1 mistake and your company can be in serious trouble. So local models is a good idea.

This is a safer workflow if implemented correctly to prevent certain types of mistakes when LLMs inevitably hallucinate or make a mistake.

That said, 200 usd? I don't believe the value is there. Someone can run a local model very easily, 1 command line call and do this themselves. For free.

arriemeijer · 2026-05-02T17:21:29.000Z 1777742489

My guess is you can't.

the best you can do is show them the code and hope they catch mistakes. Data scientists who can't read code probably shouldn't be running AI generated analysis on real data.

jiggunjer · 2026-05-02T14:13:05.000Z 1777731185

IME "real data work" doesn't involve notebooks.

msp26 · 2026-05-02T16:12:30.000Z 1777738350

I like starting most of my projects on marimo notebooks now and slowly moving parts of it to the main codebase + db.

By the end of it I might remove the notebook entirely but usually I keep it for some visualisation + running stuff as a cli tool.