JetBrains Launches Mellum: An Open AI Model for Code Completion

JetBrains, the brains behind some of the most beloved developer tools out there, has thrown its hat into the open AI ring with Mellum—a code-generating model that’s got developers talking. 😊 With a diet of over 4 trillion tokens and a hefty 4 billion parameters, Mellum isn’t just smart; it’s like that one friend who finishes your sentences, but for code. Imagine typing away and having it suggest what comes next, making those long coding sessions a tad less grueling.

Now, here’s the scoop on how Mellum got so sharp: a mix of GitHub code (the kind that plays nice with licenses) and English Wikipedia articles, all chewed through in roughly 20 days on a beastly setup of 256 Nvidia H200 GPUs. But don’t get too excited—Mellum isn’t quite ready to take the wheel. It needs some fine-tuning, and while JetBrains has tossed out a few Python-tuned versions, they’re more like appetizers than the main course for your production needs.

Let’s talk about the elephant in the room: security. AI-generated code can sometimes be a wild card, and Mellum’s no exception. JetBrains is upfront about it—this model might pick up a few bad habits from public codebases, meaning it won’t always give you the cleanest, safest suggestions. It’s a nifty tool, sure, but it’s not about to replace your good old-fashioned code reviews.

In the end, JetBrains is pitching Mellum as more of a conversation starter than the final word. It’s all about getting the ball rolling on innovation and teamwork in the AI coding world. Fancy taking it for a spin? Mellum’s hanging out on Hugging Face, ready for your next coding adventure.

Related news