溫馨提示×

溫馨提示×

您好，登錄后才能下訂單哦！

密碼登錄×

忘記密碼？

登錄注冊×

獲取短信驗證碼

其他方式登錄

點擊登錄注冊即表示同意《億速云用戶服務條款》

用戶登錄×

賬戶密碼登錄

請使用微信掃描上方二維碼

使用幫助

請求超時！

請點擊重新獲取二維碼

怎么用Tensorflow完成手寫數字識別

發布時間：2021-12-21 17:10:36 來源：億速云閱讀：319 作者：柒染欄目：大數據

# 怎么用TensorFlow完成手寫數字識別

## 一、項目概述

手寫數字識別是計算機視覺和機器學習領域的經典入門項目。本文將通過TensorFlow 2.x實現一個基于MNIST數據集的卷積神經網絡（CNN）模型，完整展示從數據預處理到模型部署的全流程。

## 二、環境準備

首先確保安裝以下環境：
```python
pip install tensorflow==2.10
pip install matplotlib numpy

三、數據加載與預處理

1. 加載MNIST數據集

TensorFlow內置了MNIST數據集：

import tensorflow as tf
from tensorflow.keras.datasets import mnist

# 加載數據
(x_train, y_train), (x_test, y_test) = mnist.load_data()

2. 數據預處理

# 歸一化到0-1范圍
x_train = x_train / 255.0
x_test = x_test / 255.0

# 添加通道維度（CNN需要）
x_train = x_train[..., tf.newaxis]
x_test = x_test[..., tf.newaxis]

# One-hot編碼標簽
y_train = tf.keras.utils.to_categorical(y_train, 10)
y_test = tf.keras.utils.to_categorical(y_test, 10)

四、構建CNN模型

1. 模型架構

model = tf.keras.Sequential([
    # 卷積層1
    tf.keras.layers.Conv2D(32, (3,3), activation='relu', input_shape=(28,28,1)),
    tf.keras.layers.MaxPooling2D((2,2)),
    
    # 卷積層2
    tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
    tf.keras.layers.MaxPooling2D((2,2)),
    
    # 全連接層
    tf.keras.layers.Flatten(),
    tf.keras.layers.Dense(128, activation='relu'),
    tf.keras.layers.Dropout(0.5),
    tf.keras.layers.Dense(10, activation='softmax')
])

2. 模型編譯

model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

五、模型訓練與評估

1. 訓練模型

history = model.fit(x_train, y_train, 
                    batch_size=128,
                    epochs=10,
                    validation_split=0.1)

2. 評估模型

test_loss, test_acc = model.evaluate(x_test, y_test)
print(f"Test accuracy: {test_acc:.4f}")

六、可視化結果

1. 訓練過程可視化

import matplotlib.pyplot as plt

plt.plot(history.history['accuracy'], label='accuracy')
plt.plot(history.history['val_accuracy'], label='val_accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.legend()
plt.show()

2. 預測示例

import numpy as np

# 隨機選擇測試樣本
idx = np.random.randint(0, x_test.shape[0])
sample = x_test[idx].reshape(1,28,28,1)

# 預測
pred = model.predict(sample)
pred_num = np.argmax(pred)

# 顯示圖像
plt.imshow(x_test[idx].reshape(28,28), cmap='gray')
plt.title(f"Predicted: {pred_num}")
plt.show()

七、模型優化技巧

數據增強：

data_augmentation = tf.keras.Sequential([
    tf.keras.layers.RandomRotation(0.1),
    tf.keras.layers.RandomZoom(0.1)
])

學習率調度：

lr_scheduler = tf.keras.callbacks.LearningRateScheduler(
    lambda epoch: 1e-3 * 10**(epoch/20))

早停機制：

early_stopping = tf.keras.callbacks.EarlyStopping(
    monitor='val_loss', patience=3)

八、模型保存與部署

1. 保存模型

model.save('mnist_cnn.h5')  # HDF5格式

2. 加載模型

new_model = tf.keras.models.load_model('mnist_cnn.h5')

3. 轉換為TensorFlow Lite（移動端部署）

converter = tf.lite.TFLiteConverter.from_keras_model(model)
tflite_model = converter.convert()
with open('mnist.tflite', 'wb') as f:
    f.write(tflite_model)

九、完整代碼示例

# 完整代碼整合（省略部分可視化代碼）
import tensorflow as tf
from tensorflow.keras.datasets import mnist

# 數據加載與預處理
(x_train, y_train), (x_test, y_test) = mnist.load_data()
x_train, x_test = x_train / 255.0, x_test / 255.0
x_train = x_train[..., tf.newaxis]
x_test = x_test[..., tf.newaxis]
y_train = tf.keras.utils.to_categorical(y_train, 10)
y_test = tf.keras.utils.to_categorical(y_test, 10)

# 構建模型
model = tf.keras.Sequential([
    tf.keras.layers.Conv2D(32, (3,3), activation='relu', input_shape=(28,28,1)),
    tf.keras.layers.MaxPooling2D((2,2)),
    tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
    tf.keras.layers.MaxPooling2D((2,2)),
    tf.keras.layers.Flatten(),
    tf.keras.layers.Dense(128, activation='relu'),
    tf.keras.layers.Dropout(0.5),
    tf.keras.layers.Dense(10, activation='softmax')
])

# 編譯與訓練
model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])
model.fit(x_train, y_train, epochs=10, validation_split=0.1)

# 評估
test_loss, test_acc = model.evaluate(x_test, y_test)
print(f"Test accuracy: {test_acc:.4f}")

十、總結

通過本教程，我們實現了： 1. 使用TensorFlow加載和預處理MNIST數據集 2. 構建了一個簡單的CNN模型 3. 完成了模型訓練和評估 4. 實現了模型保存與轉換

該模型準確率可達99%以上，可作為其他計算機視覺任務的基礎模板。 “`

向AI問一下細節

推薦閱讀：

免責聲明：本站發布的內容（圖片、視頻和文字）以原創、轉載和分享為主，文章觀點不代表本網站立場，如果涉及侵權請聯系站長郵箱：is@yisu.com進行舉報，并提供相關證據，一經查實，將立刻刪除涉嫌侵權內容。

上一篇新聞：
Spring中Bean的作用域與生命周期是什么
下一篇新聞：
SDN和NFV的基礎以及Intel的思考是怎樣的

猜你喜歡

AI
助
手

產品服務

地區劃分

專題活動

幫助支持

關于我們

售后咨詢

7*24小時在線電話：400-100-2938

7*24小時在線 QQ：800811969

關注億速云

億速云公眾號

手機網站二維碼

亚洲午夜精品一区二区_中文无码日韩欧免_久久香蕉精品视频_欧美主播一区二区三区美女