当前位置 博文首页 > DL_fan的博客:nginx做负载均衡,解决多机器多gpu卡服务对外暴露

    DL_fan的博客:nginx做负载均衡,解决多机器多gpu卡服务对外暴露

    作者:[db:作者] 时间:2021-07-10 22:26

    思路:多个gpu 服务接口-->ngxin做负载均衡-->对外暴露一个。

    以一机两卡为例,其中gunicorn部署一卡多进程服务参考这篇文章

    一.制作nginx负载均衡镜像

    1.制作Dockerfie

    FROM nginx:1.13.3
    COPY ./ /
    RUN mkdir /app
    COPY /nginx.conf /etc/nginx/nginx.conf

    2.nginx.conf详细

    
    #user  nobody;
    worker_processes  1;
    
    #error_log  logs/error.log;
    #error_log  logs/error.log  notice;
    #error_log  logs/error.log  info;
    
    #pid        logs/nginx.pid;
    
    
    events {
        worker_connections  1024;
    }
    
    
    http {
        include       mime.types;
        default_type  application/octet-stream;
    
        #log_format  main  '$remote_addr - $remote_user [$time_local] "$request" '
        #                  '$status $body_bytes_sent "$http_referer" '
        #                  '"$http_user_agent" "$http_x_forwarded_for"';
    
        #access_log  logs/access.log  main;
    
        sendfile        on;
        #tcp_nopush     on;
    
        #keepalive_timeout  0;
        keepalive_timeout  65;
    
        #gzip  on;
    
    	#bx----------------------
    	upstream algoserver{
    	    server 192.168.102.200:10009;
    	}
    	
        server {
            listen       8082;
            server_name  localhost;
    
            #charset koi8-r;
    
            #access_log  logs/host.access.log  main;
    
            location / {
                #root   html;
                #index  index.html index.htm;
    			#bx--------------------------------
    			proxy_pass http://algoserver;
    			proxy_set_header Host $host;
            }
    
            #error_page  404              /404.html;
    
            # redirect server error pages to the static page /50x.html
            #
            error_page   500 502 503 504  /50x.html;
            location = /50x.html {
                root   html;
            }
    
            # proxy the PHP scripts to Apache listening on 127.0.0.1:80
            #
            #location ~ \.php$ {
            #    proxy_pass   http://127.0.0.1;
            #}
    
            # pass the PHP scripts to FastCGI server listening on 127.0.0.1:9000
            #
            #location ~ \.php$ {
            #    root           html;
            #    fastcgi_pass   127.0.0.1:9000;
            #    fastcgi_index  index.php;
            #    fastcgi_param  SCRIPT_FILENAME  /scripts$fastcgi_script_name;
            #    include        fastcgi_params;
            #}
    
            # deny access to .htaccess files, if Apache's document root
            # concurs with nginx's one
            #
            #location ~ /\.ht {
            #    deny  all;
            #}
        }
    
    
        # another virtual host using mix of IP-, name-, and port-based configuration
        #
        #server {
        #    listen       8000;
        #    listen       somename:8080;
        #    server_name  somename  alias  another.alias;
    
        #    location / {
        #        root   html;
        #        index  index.html index.htm;
        #    }
        #}
    
    
        # HTTPS server
        #
        #server {
        #    listen       443 ssl;
        #    server_name  localhost;
    
        #    ssl_certificate      cert.pem;
        #    ssl_certificate_key  cert.key;
    
        #    ssl_session_cache    shared:SSL:1m;
        #    ssl_session_timeout  5m;
    
        #    ssl_ciphers  HIGH:!aNULL:!MD5;
        #    ssl_prefer_server_ciphers  on;
    
        #    location / {
        #        root   html;
        #        index  index.html index.htm;
        #    }
        #}
    
    }

    其中server 192.168.102.200:10009;
    ? ? ? ? server 192.168.102.200:10010;

    就是gpu启动的两个服务,现在映射为192.168.102.200:8082.

    3.build镜像

    docker build -t nginx/express:0.1 .

    二.启动容器做负载均衡

    上面的8082端口就对外映射为10016,用户就可以通过10016调用10009和10010的gpu服务啦。

    docker run -it -p 10016:8082 -v /home/fanzonghao/red_detection/software/nginx.conf:/etc/nginx/nginx.conf nginx/express:0.1

    ?

    cs