For example, a company may concatenate several daily reports into one monthly report at the end of a month. When you concatenate two or more PDFs, you join the files one after another into a single document.
#PYTHON 3 PDFWRITER XTOPUS PDF#
Two common tasks when working with PDF files are concatenating and merging several PDFs into a single file. When you’re ready, you can move on to the next section. open ( mode = "wb" ) as output_file : pdf_writer. home () / "last_page.pdf" with output_path.
#PYTHON 3 PDFWRITER XTOPUS FULL#
When you save and run the program, it will create a new file in your home directory called Pride_and_Prejudice.txt containing the full text of the Pride_and_Prejudice.pdf document. The text from each page is extracted with page.extractText() and is written to the output_file. At each step in the loop, the next PageObject is assigned to the page variable. Then, inside the with block, you write the PDF title and number of pages to the text file using output_file.write().įinally, you use a for loop to iterate over all the pages in the PDF. The with statement, which you learned about in chapter 12, “File Input and Output,” ensures that the file is closed when the with block exits. Next, you open output_file_path in write mode and assign the file object returned by. You also create a new Path object that points to the file Pride_and_Prejudice.txt in your home directory and assign it to the output_file_path variable. write ( text )įirst, you assign a new PdfFileReader instance to the pdf_reader variable. write ( f " \\ n \\ n" ) # 4 for page in pdf_reader. open ( mode = "w" ) as output_file : # 3 title = pdf_reader. home () / "Pride_and_Prejudice.txt" # 2 with output_file_path. home () / "creating-and-modifying-pdfs" / "practice-files" / "Pride_and_Prejudice.pdf" ) # 1 pdf_reader = PdfFileReader ( str ( pdf_path )) output_file_path = Path. pages attribute that you can use to iterate over all of the pages in the PDF in order.įor example, the following for loop prints the text from every page in the Pride and Prejudice PDF:įrom pathlib import Path from PyPDF2 import PdfFileReader # Change the path below to the correct path for your computer. The output you see on your computer may be formatted differently.Įvery PdfFileReader object has a. Note that the output displayed here has been formatted to fit better on this page. Produced by Anonymous Volunteers, and David Widger *** START OF THIS PROJECT GUTENBERG EBOOK PRIDE AND PREJUDICE *** Use it under the terms of the Project Gutenberg License included This eBook is for the use of anyone anywhere at no cost and withĪlmost no restrictions whatsoever. The Project Gutenberg EBook of Pride and Prejudice, by Jane Austen