손글씨 숫자 인식 TensorFlow와 PyTorch를 활용한 신경망 구현

728x90

손글씨 숫자 인식: TensorFlow와 PyTorch를 활용한 신경망 구현

Overview

손글씨 숫자 인식을 위해 TensorFlow와 PyTorch를 사용하여 신경망을 구현하는 방법을 살펴보겠습니다. 이번 설명에서는 MNIST 데이터셋을 사용하여 신경망 모델을 구축하고 학습시키는 과정에 대해 자세히 다룰 것입니다. 두 가지 프레임워크 모두에 대해 단계별로 설명하되, 하나의 프레임워크를 깊이 있게 다룰 것입니다.

1. TensorFlow를 사용한 손글씨 숫자 인식

TensorFlow는 Google이 개발한 오픈소스 머신러닝 프레임워크입니다. 손글씨 숫자 인식 문제를 해결하기 위해, TensorFlow의 Keras API를 활용하여 신경망을 쉽게 구현할 수 있습니다. 아래는 TensorFlow를 사용하여 MNIST 데이터셋을 학습시키는 단계별 가이드입니다.

1.1. 환경 설정

우선 TensorFlow와 필요한 라이브러리를 설치합니다. 터미널에서 다음 명령어를 실행하여 설치합니다:

pip install tensorflow

1.2. 데이터셋 로딩

MNIST 데이터셋은 TensorFlow에서 직접 로드할 수 있습니다. 데이터셋은 손글씨 숫자 이미지와 라벨을 포함합니다.

import tensorflow as tf
from tensorflow.keras.datasets import mnist

# 데이터셋 로드
(x_train, y_train), (x_test, y_test) = mnist.load_data()

# 데이터 전처리
x_train, x_test = x_train / 255.0, x_test / 255.0  # 정규화

1.3. 모델 정의

모델은 여러 층을 포함한 신경망입니다. 가장 간단한 형태는 Sequential API를 사용하는 것입니다.

from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Flatten

model = Sequential([
Flatten(input_shape=(28, 28)),  # 입력 데이터 형태 (28x28 이미지)
Dense(128, activation='relu'),  # 은닉층
Dense(10, activation='softmax')  # 출력층
])

1.4. 모델 컴파일

모델을 학습시키기 전에 컴파일 과정을 거칩니다. 이 과정에서는 손실 함수와 옵티마이저를 설정합니다.

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

1.5. 모델 학습

이제 모델을 학습시킵니다. 학습 데이터와 검증 데이터를 제공하여 모델을 훈련시킵니다.

model.fit(x_train, y_train, epochs=5, validation_split=0.2)

1.6. 모델 평가

학습이 완료된 후, 테스트 데이터셋을 사용하여 모델의 성능을 평가합니다.

test_loss, test_acc = model.evaluate(x_test, y_test)
print(f"Test accuracy: {test_acc}")

2. PyTorch를 사용한 손글씨 숫자 인식

PyTorch는 Facebook에서 개발한 오픈소스 머신러닝 프레임워크로, 동적 계산 그래프를 지원하여 보다 유연한 모델을 구현할 수 있습니다. PyTorch를 사용하여 MNIST 데이터셋을 학습시키는 방법을 살펴보겠습니다.

2.1. 환경 설정

PyTorch와 필요한 라이브러리를 설치합니다. 다음 명령어를 실행하여 설치합니다:

pip install torch torchvision

2.2. 데이터셋 로딩

PyTorch에서는 torchvision 라이브러리를 사용하여 데이터셋을 로드합니다.

import torch
import torchvision
import torchvision.transforms as transforms

# 데이터 변환 정의
transform = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.5,), (0.5,))])

# 데이터셋 로드
trainset = torchvision.datasets.MNIST(root='./data', train=True, download=True, transform=transform)
trainloader = torch.utils.data.DataLoader(trainset, batch_size=4, shuffle=True)

testset = torchvision.datasets.MNIST(root='./data', train=False, download=True, transform=transform)
testloader = torch.utils.data.DataLoader(testset, batch_size=4, shuffle=False)

2.3. 모델 정의

PyTorch에서는 torch.nn.Module을 상속받아 모델을 정의합니다.

import torch.nn as nn
import torch.optim as optim

class Net(nn.Module):
def __init__(self):
super(Net, self).__init__()
self.fc1 = nn.Linear(28*28, 128)
self.fc2 = nn.Linear(128, 10)

def forward(self, x):
x = x.view(-1, 28*28)  # Flatten
x = torch.relu(self.fc1(x))
x = self.fc2(x)
return x

net = Net()

2.4. 손실 함수와 옵티마이저 설정

모델을 학습시키기 위해 손실 함수와 옵티마이저를 설정합니다.

criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(net.parameters(), lr=0.001)

2.5. 모델 학습

학습 루프를 설정하여 모델을 학습시킵니다.

for epoch in range(5):  # 에폭 수
running_loss = 0.0
for inputs, labels in trainloader:
optimizer.zero_grad()
outputs = net(inputs)
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()
running_loss += loss.item()
print(f"Epoch {epoch+1}, Loss: {running_loss/len(trainloader)}")

2.6. 모델 평가

테스트 데이터셋을 사용하여 모델을 평가합니다.

correct = 0
total = 0
with torch.no_grad():
for images, labels in testloader:
outputs = net(images)
_, predicted = torch.max(outputs.data, 1)
total += labels.size(0)
correct += (predicted == labels).sum().item()

print(f'Accuracy of the network on the 10000 test images: {100 * correct / total}%')

참고문서