Skip to main content
  1. Blog
  2. Article

Hasmik Zmoyan
on 6 November 2023


Welcome to the 7th episode of Ubuntu AI podcast. Together with industry experts we’re discussing the topic of the year: AI.

From fun experiments to enterprise projects, AI became the center of attention when it comes to innovation, digital transformation and optimization.

Open source technologies democratized access to state-of-the-art machine learning tools and open doors for everyone . In this episode we have a special guest, together with whom we are discussing how the data in AI is being used and how you can make most of your data within organizations.

Open Source in modern software world.

For software, we’re at the point where it’s very clear that open source is one of the main pillars, if not the main pillar that the software world has been built on.

It has enabled crazy growth and tooling and enabled options for companies that are working in AI space.

We can see, that similar trends are already seen in Machine Learning. Even for LLMs and large cutting-edge models those are going to be based on open source: maybe customized, tailored in other ways for company needs.

Is there any movement in getting more open source data into universities and research institutions?

In world of software development the source is your code, in world of data and machine learning the source is code + data and data presents a lot of unique challenges.

One is the fact, that if you’re using data coming from real people and you might have PII (Personality identifying information) there will be a lot of sensitivity in that data. Companies are not willing to share it, even though they use open source, being considered as a business advantage.

But we are seeing a lot of open source data sets that are being released …

Should the data be static?

When you want to get to production, you realize that data is actually not static – it’s dynamic. You get data coming in all the time and that’s what helps you to improve your model over time.

Another thing that helps is when you;re subscribed to old data center AI. There are a lot of open source data sets. They are not dynamic enough yet, but there are told that enables community members to actually enable data.

Where to learn more about AI?

First, make sure to subscribe to our bi-weekly podcasts, where we are discussing AI. Listen to our podcasts on Spotify or Apple Podcasts.

If you want to learn more about the AI solutions we provide check our website and feel free to get in touch via contact forms or live chats here.

Related posts


Canonical
5 January 2026

Canonical announces Ubuntu support for the NVIDIA Rubin platform

Canonical announcements Article

Official Ubuntu support for the NVIDIA Rubin platform, including the NVIDIA Vera Rubin NVL72 rack-scale systems, announced at CES 2026 CES 2026, Las Vegas. – Canonical, the publisher of Ubuntu, is pleased to announce official support for the NVIDIA Rubin platform and the latest distributions of the new NVIDIA Nemotron 3 open models.  As A ...


Canonical
16 March 2026

Canonical announces it will distribute NVIDIA DOCA-OFED in Ubuntu

AI Article

Today Canonical, the publishers of Ubuntu, announced that it will integrate and distribute the NVIDIA DOCA-OFED networking driver with Ubuntu. ...


Canonical
16 March 2026

Meet Canonical at NVIDIA GTC 2026: NVIDIA CUDA and NVIDIA Vera Rubin NVL72 support in Ubuntu 26.04 LTS

Ubuntu Article

Previewing at NVIDIA GTC 2026: NVIDIA CUDA support in Ubuntu 26.04 LTS, NVIDIA Vera Rubin NVL72 architecture support in Ubuntu 26.04, Canonical’s official Ubuntu image for NVIDIA Jetson Thor, upcoming support for NVIDIA DGX Station and NVIDIA DOCA-OFED, and NVIDIA RTX PRO 4500 support. NVIDIA GTC 2026 is here, bringing together the techno ...