Python断点续传下载

断点续传是指下载文件的过程中出现中断，重新下载时从中断处继续下载，而不是重新开始下载整个文件。Python提供了一些库和方法来实现断点续传下载，本文将从多个方面介绍Python断点续传下载的实现方法。

一、使用requests库实现断点续传下载

requests是Python中常用的网络请求库，可以很方便地发送HTTP请求并获取相应内容。通过设置请求头中的Range字段，可以指定下载文件的起始位置，实现断点续传下载。

以下是使用requests库实现断点续传下载的示例代码：

import requests

url = 'http://example.com/file.txt'
file_path = 'download/file.txt'
headers = {}

def download_file(url, file_path, headers):
    response = requests.get(url, headers=headers, stream=True)
    total_size = int(response.headers.get('Content-Length', 0))
    downloaded_size = 0

    with open(file_path, 'ab') as file:
        for chunk in response.iter_content(chunk_size=8192):
            if chunk:
                file.write(chunk)
                file.flush()
                downloaded_size += len(chunk)
                print(f'Downloaded {downloaded_size}/{total_size}', end='r')

    print('Download complete.')

if __name__ == '__main__':
    download_file(url, file_path, headers)

上述代码中，我们通过requests.get()方法发送HTTP请求，并设置stream=True参数来启用流式下载。在下载过程中，我们使用iter_content()方法迭代下载的数据块，将其写入本地文件，并同时打印下载进度。

二、使用urllib库实现断点续传下载

除了使用requests库外，还可以使用Python内置的urllib库来实现断点续传下载。通过设置Range请求头，可以指定下载的起始位置和结束位置。

以下是使用urllib库实现断点续传下载的示例代码：

from urllib import request

url = 'http://example.com/file.txt'
file_path = 'download/file.txt'
headers = {'Range': 'bytes=0-'}

def download_file(url, file_path, headers):
    req = request.Request(url, headers=headers)
    response = request.urlopen(req)
    total_size = int(response.headers.get('Content-Length', 0))
    downloaded_size = 0

    with open(file_path, 'ab') as file:
        while True:
            chunk = response.read(8192)
            if not chunk:
                break

            file.write(chunk)
            file.flush()
            downloaded_size += len(chunk)
            print(f'Downloaded {downloaded_size}/{total_size}', end='r')

    print('Download complete.')

if __name__ == '__main__':
    download_file(url, file_path, headers)

上述代码中，我们通过urllib.request.urlopen()方法发送HTTP请求，并设置Range请求头。在下载过程中，我们使用read()方法读取数据，并将其写入本地文件，同时打印下载进度。

三、使用wget库实现断点续传下载

除了使用requests和urllib库外，还可以使用第三方库wget来实现断点续传下载。wget是一个命令行工具，也可以通过Python来使用它。

以下是使用wget库实现断点续传下载的示例代码：

import wget

url = 'http://example.com/file.txt'
file_path = 'download/file.txt'

def download_file(url, file_path):
    wget.download(url, out=file_path)

if __name__ == '__main__':
    download_file(url, file_path)

上述代码中，我们使用wget.download()方法来下载文件，并指定保存文件的路径。

四、总结

本文介绍了使用requests、urllib和wget三种方式实现Python断点续传下载的方法。通过设置请求头中的Range字段，可以指定下载文件的起始位置，实现断点续传下载。选择合适的方法取决于个人的实际需求和偏好。