AWS + DIGITS + ubuntu14.04

Intro.

本篇blog紀錄如何在AWS上部署DIGITS,來實現深度學習的功能

aws相關指令

由本地連接aws的ssh相關指令。

  • 連接aws:
  • 下載密鑰文件
  • 給密鑰文件權限:chmod 400 xxxx-key-pair.pem
  • ssh連接:ssh -i xxxx-key-pair.pem 系統用戶名@ip地址
  • aws的上傳,下載文件:

    格式:
    scp -i xxx-key-pair.pem 需要上傳的文件路徑 系統用戶名@ip地址:目的地路徑

    scp -i /Users/junhao/Downloads/junhao_aws.pem " file path " ubuntu@ec2-34-201-250-52.compute-1.amazonaws.com:" target path "
  • Linux send to 65400 port
    注意-P 是大寫
    scp –P 8023 /var/www/html/* admin@ccsh.no-ip.tw:/home/data/


架設環境

Check enviroment

sudo su #切換成root
lspci | grep -i nvidia #檢查GPU
uname -m && cat /etc/*release #檢查OS

Install Nvidia Driver
原本是sudo apt-get install nvidia-375 nvidia-settings,但發現這樣driver會安裝失敗,故將nvidia-setting拿掉

sudo apt-get update && sudo apt-get -y upgrade &&
sudo apt-get install -y linux-image-extra-`uname -r` &&
sudo add-apt-repository ppa:graphics-drivers/ppa &&
sudo apt-get update &&
sudo apt-get install nvidia-375

For Ubuntu 14.04, Install repo packages

CUDA_REPO_PKG=cuda-repo-ubuntu1404_7.5-18_amd64.deb &&
wget http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1404/x86_64/$CUDA_REPO_PKG &&
sudo dpkg -i $CUDA_REPO_PKG
ML_REPO_PKG=nvidia-machine-learning-repo-ubuntu1404_4.0-2_amd64.deb &&
wget http://developer.download.nvidia.com/compute/machine-learning/repos/ubuntu1404/x86_64/$ML_REPO_PKG &&
sudo dpkg -i $ML_REPO_PKG

Install DIGITS

sudo apt-get update
sudo apt-get install digits