前言

有沒有發現一個生活中的現象，我們很少有時間去真正讀一些存放在電腦或者ipad上的pdf書籍。我們打算讀這些書，但從來沒有讀過。所以我們為什麼不用Python把它們做成有聲書，一邊聽一邊做別的事情呢？

我們計劃Python腳本步驟是這樣的：

允許用戶選擇讀取一個.pdf文件
將文件內容轉換為一個字符串
輸出的mp3音頻文件

允許用戶選擇讀取一個.pdf文件

Python可以輕鬆地讀取文件。

我只需要使用open(「filelocation」，「rb」)在讀取模式下打開文件。但我不想每次使用代碼時都要將文件複製並粘貼到代碼目錄中。因此，為了使它更容易，我們將使用tkinter庫來打開一個讓我們選擇文件的接口：

from tkinter import Tk
from tkinter.filedialog import askopenfilename

Tk().withdraw() # we don't want a full GUI, so keep the root window from appearing
filelocation = askopenfilename() # open the dialog GUI

太好了。現在，我們將文件位置存儲在filelocation變量中。

將文件轉換為一個字符串

如前所述，要在Python中打開文件，我們只需要使用open()方法。但是我們還希望將pdf文件轉換為常規文本。

為此，我們將使用一個名為pdftotext的庫。

先安裝:

sudo pip install pdftotext

然後：

from tkinter import Tk
from tkinter.filedialog import askopenfilename
import pdftotext

Tk().withdraw() # we don't want a full GUI, so keep the root window from appearing
filelocation = askopenfilename() # open the dialog GUI

with open(filelocation, "rb") as f:  # open the file in reading (rb) mode and call it f
    pdf = pdftotext.PDF(f)  # store a text version of the pdf file f in pdf variable

如果您列印這個變量，您將得到一個字符串數組。每個字符串都是文件中的一行。要將它們全部存儲到一個.mp3文件中，我們必須確保它們都存儲為一個字符串。讓我們循環這個數組並將它們全部添加到一個字符串中：

from tkinter import Tk
from tkinter.filedialog import askopenfilename
import pdftotext

Tk().withdraw() # we don't want a full GUI, so keep the root window from appearing
filelocation = askopenfilename() # open the dialog GUI

with open(filelocation, "rb") as f:  # open the file in reading (rb) mode and call it f
    pdf = pdftotext.PDF(f)  # store a text version of the pdf file f in pdf variable

string_of_text = ''
for text in pdf:
    string_of_text += text

輸出.mp3文件

現在，我們準備使用gTTS(谷歌文本到語音)庫。我們所需要做的就是傳遞我們創建的字符串，將輸出存儲在一個變量中，然後使用save()方法將文件輸出到計算機。

先安裝：

sudo pip install gtts

然後：

from tkinter import Tk
from tkinter.filedialog import askopenfilename
import pdftotext
from gtts import gTTS

Tk().withdraw() # we don't want a full GUI, so keep the root window from appearing
filelocation = askopenfilename() # open the dialog GUI

with open(filelocation, "rb") as f:  # open the file in reading (rb) mode and call it f
    pdf = pdftotext.PDF(f)  # store a text version of the pdf file f in pdf variable

string_of_text = ''
for text in pdf:
    string_of_text += text

final_file = gTTS(text=string_of_text, lang='en')  # store file in variable
final_file.save("Generated Speech.mp3")  # save file to computer

就這麼簡單!快去拿你的pdf去嘗試吧。

參考：

https://dev.to/mustafaanaskh99/convert-any-pdf-file-into-an-audio-book-with-python-1gk4

利用Python將. pdf電子書籍📚轉換成音頻有聲讀物🔈

前言

允許用戶選擇讀取一個.pdf文件

將文件轉換為一個字符串

輸出.mp3文件

你應該知道的10個Python文件系統方法

微軟考慮將Python作為官方腳本語言添加到Excel中？

利用深度學習技術檢測x射線圖像中的COVID-19

利用Python將. pdf電子書籍📚轉換成音頻有聲讀物🔈

量子機器學習:人工智慧的未來

為什麼我們的機器學習平台支持Python而不是R

理解Python中的下劃線(_)的五種用法

如何構建一個問答聊天機器，來自史丹福大學課程研究成果的演示

如何在Kaggle上打比賽，帶你進行一次完整流程體驗

Python快速實戰機器學習之數據預處理

盤點那些鮮為人知卻非常實用的Python數據科學庫

蘋果Siri | 亞馬遜Alexa | 谷歌Assistant | 誰最聰明？

史上最全的Python大集合：awesome-python

帶你了解一下神奇的Github

關於Tensorflow深度學習書籍調研

想在2020學習Python，這裡有8個不可以錯過的免費教程和網站

年度盤點，30個開創性的Python開源項目-你都用過哪些？

為什麼Python這麼慢？

可視化神器Seaborn的超全介紹

Jupyter Notebook 使用手冊

如何爬取知乎漂亮妹紙照片第二彈

盤點2018年30個令人驚嘆的Github上Python項目

Python新手入門指引

NumPy最生動形象的教程