The page has been translated by Gen AI.

CloudML based AI analysis environment configuration

CloudML based AI analysis environment configuration

Overview

Machine Learning (hereafter ML) is being increasingly applied in industry. In the past, data analysts performed all processes from model development/training to model deployment manually, but as the number of models being developed and operated increased, managing and operating the models required a lot of cost and effort. Therefore, a function that can automate management across the entire model lifecycle and operate it easily is needed.

CloudML is the name of a Samsung Cloud Platform product that provides modularized services based on Kubernetes for automated model lifecycle management. It describes the main module functions.

- Notebook: Provides a service that easily configures and uses the open-source Jupyter Notebook in the Samsung Cloud Platform environment.

- Studio: A no-code based ML model development platform that allows anyone to easily develop models by Drag and Drop in a visualized analysis environment, and view the development process with a visualized UI.

- Experiments: We provide a model training experiment management service based on the open-source MLFlow. In addition to experiment management, we also provide Model Registry and Model Verification features.

- Pipeline: You can automate and manage the entire process from model development to training. It allows flexible allocation of resources and images for each execution stage, and provides sequential execution and scheduling functions.

This document explains how to configure and use CloudML on the Samsung Cloud Platform.

Architecture Diagram

Diagram
Figure 1. Kubernetes-based CloudML environment configuration
  1. CloudML is deployed on Kubernetes. Users first create the Kubernetes Engine product, and at that time the Persistent Volume (PV) of the Kubernetes Cluster is created from the File Storage product.

  2. Configure a Kubernetes Cluster in the user’s VPC and deploy CloudML to the same Cluster. Once CloudML installation is complete, you can access the user View to use the features.

  3. In the Kubernetes Cluster, use the container image repository Container Registry to Pull/Push user Container Image.

  4. CloudML can be linked with Object Storage or Cloud Hadoop for using and storing analysis data sets and model files.

Infrastructure Configuration Example

App Node Pool(24-Core) must be configured, and the Node Pool for analysis can be configured with CPU or GPU according to the analysis environment.

For the Copilot service, an additional GPU Node Pool consisting of a 16‑core CPU or one GPU is required.

Configured with CPU Node Pool only

Diagram

Configured with CPU/GPU Node Pool (GPU environment setup for Copilot)

Architecture diagram

Use Cases

A. Construction of a visualized collaborative analysis system based on No-Code AI

CloudML provides a No-Code based visualized analysis environment. You can develop models easily even as a non-expert by arranging visualized functions with Drag & Drop. The developed model can be combined with Python code through Pipeline. Through this, professional analysts and non‑experts can collaborate and develop services.

B. Model Optimization Automation through Adoption of Analysis Platform

Pipeline provides analysis process automation capabilities. In the past, we performed analysis and simulation tasks manually based on Excel. However, by introducing CloudML and combining products, we can derive a model that shows optimal performance through automatic data I/F and analysis model simulation. Through automation conversion, we can reduce work time from several days to several hours.

Prerequisite items

CloudML A Kubernetes Cluster and File Storage that meet or exceed the minimum specifications provided by each product are required for installing each product.

Constraints

CloudML To integrate each product, it must be deployed on the same Kubernetes Worker Node.

Considerations

Data set, model utilization and storage can consider configuring the Object Storage service. To utilize custom images, consider configuring the Container Registry service.

Related Products

This is a list of Samsung Cloud Platform services that are related to the functions or configurations described in this guide. Please refer to it when selecting and designing services.

Service GroupServiceDetailed Description
StorageFile StorageStorage that shares files among multiple client servers through a network connection
StorageObject StorageObject storage that facilitates data storage and retrieval
ContainerKubernetes EngineKubernetes container orchestration service
ContainerContainer RegistryA service that easily stores, manages, and shares container images
NetworkingVPCA service that provides an independent virtual network in a cloud environment
NetworkingSecurity GroupVirtual firewall that controls server traffic
NetworkingLoad BalancerService that automatically distributes server traffic load
Table. List of related services