PDF.co API offers a lot of file manipulation and conversion features. You can easily convert your PDF files to CSV, XLS, XLSX, JSON, XML, or Text in a matter of seconds. It doesn’t matter even if your PDF file contains any scanned images, PDF.co API extracts text from unstructured documents and images using built-in AI-Powered OCR. No code automation with Zapier, Integromat, Make (more!) is available.
Let us explore the following headings to learn PDF conversion:
Why PDF.co API
- Security: PDF.co API runs on the secure and certified Amazon AWS infrastructure. All data transfers are encrypted by SSL/TLS encryption. See the security page for more details. The On-prem version can run on any hosting provider and cloud storage of your choice.
- Support for Large Files You can process large files and documents with hundreds of pages in the cloud using async mode.
- Battle-tested by thousands of production users. Our engines are tested in production by thousands of enterprise users.
- Pay-as-you-go system. You can go with a subscription or use pre-paid credits that can be spent per page. You can also go with a subscription to get more credits at the same price compared to one-time credits. For details please explore your API logs.
- On-Prem API Server and On-Prem SDK are available. Enterprise users may obtain ByteScout API Server for use on their self-hosted server (Windows Server). For more information on the On-Prem version please read here.
- OCR support. AI-powered OCR for increased accuracy and error handling.
PDF Extractor API Supports Multiple Languages
PDF.co platform can be used by software developers from programming languages such as PHP, Javascript, .NET and ASP.NET, C#, Java, Visual Basic, and many others. Find source code samples in our API documentation.
Business Automation Platforms Integrations
If you are not a developer, you can also easily automate your PDF operations via popular business automation platforms: Zapier, Make, Airtable, Salesforce, Google Apps Script, and 300+ more.
Enterprise Solutions
PDF.co platform runs on secure and certified cloud infrastructure. For enterprise customers, there is a Dedicated API Server available in any region of your choice.
Extract PDF to XLS
This API will convert your PDF file and scanned images to a spreadsheet with layout and fonts preserved. You just need to provide the URL as input to the API and the rest of the things will be taken care of by API. You can pass a link to a file from Google Drive, Dropbox, or another online file service that can generate shareable links.
Now let me walk you through the PDF.co API endpoint documentation.
Endpoint
URL: | https://api.pdf.co/v1/pdf/convert/to/xls |
Method: | POST |
Parameter | Description |
Url | points to the source file to be converted. |
async | optional. Set to true to run as an async job in the background (recommended for heavy documents). |
name | optional. The filename for the generated output. Must be a String. |
pages | optional. Comma-separated list of page indices (or ranges) to process. |
rect | optional. Defines coordinates for extraction, e.g. 51.8, 114.8, 235.5, 204.0. |
lang | optional. Sets OCR language to be used for scanned PDF, PNG, JPG documents when extracting text from them. Default is eng. |
Inline | optional. Returns the link of the output file (default). |
encrypt | optional. Enable encryption for the output file. |
PDF to XLS Demo
PDF to XLS cURL Code Snippet
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/xls' \ --header 'x-api-key: {{x-api-key}}' \ --header 'Content-Type: application/json' \ --data-raw '{ "url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-excel/sample.pdf" }'
Extract PDF to XLSX
This API will convert your PDF file to a spreadsheet with layout and fonts preserved. You just need to provide the URL as input to the API and the rest of the things will be taken care of by API.
Now let me walk you through the PDF.co API endpoint documentation.
Endpoint
URL: | https://api.pdf.co/v1/pdf/convert/to/xlsx |
Method: | POST |
Parameter | Description |
Url | points to the source file to be converted. |
Password | optional. Specify the password if your PDF is password-protected. |
name | optional. The filename for the generated output. Must be a String. |
PDF to XLSX Demo
PDF to XLSX cURL Code Snippet
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/xlsx?=' \ --header 'x-api-key: {{xi-api-key}}' \ --header 'Content-Type: application/json' \ --data-raw '{ "url":"https://bytescout-com.s3.amazonaws.com/files/demo-files/cloud-api/pdf-to-excel/sample.pdf", "name": "result.xlsx" }'
Extract PDF to XML
This API will convert your PDF file to XML with information about text value, tables, fonts, images, and object positions.
Now let me walk you through the PDF.co API endpoint documentation.
Endpoint
URL: | https://api.pdf.co/v1/pdf/convert/to/xml |
Method: | POST |
Parameter | Description |
Url | points to the source file to be converted. |
inline | “true”: To return data as inline or “false” to return a link to the output file. |
PDF To XML Demo
PDF to XML cURL Code Snippet
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/xml' \ --header 'x-api-key: {{x-api-key}}' \ --header 'Content-Type: application/json' \ --data-raw '{ "url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-xml/sample.pdf" }'
Extract PDF to JSON
This API will convert your PDF file into JSON representation with text, fonts, images, vectors, and formatting preserved. We have created one basic course on JSON file format which you can see on our YouTube Channel.
Now let me walk you through the PDF.co API endpoint documentation.
Endpoint
URL: | https://api.pdf.co/v1/pdf/convert/to/json2 |
Method: | POST |
Parameter | Description |
Url | points to the source file to be converted. |
inline | optional. To return data as inline or “false” to return link to the output file. |
PDF To JSON Demo
PDF to JSON cURL Code Snippet
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/json2' \ --header 'Content-Type: application/json' \ --header 'x-api-key: {{x-api-key}}' \ --data-raw '{ "url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-json/sample.pdf", "inline": true }'
Extract PDF to Text
This API will convert your PDF file to Text with layout preserved.
Now let me walk you through the PDF.co API endpoint documentation.
Endpoint
URL: | https://api.pdf.co/v1/pdf/convert/to/text |
Method: | POST |
Parameter | Description |
Url | points to the source file to be converted. |
inline | optional. To return data as inline or “false” to return link to the output file. |
PDF To TEXT Demo
PDF to TEXT cURL Code Snippet
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/text' \ --header 'Content-Type: application/json' \ --header 'x-api-key: {{x-api-key}}' \ --data-raw '{ "url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-text/sample.pdf" }'
Extract PDF to CSV
This API will convert your PDF file into CSV representation with layout, columns, rows, and tables. CSV files contain comma-separated values which are usually differentiated by comma. We have created one basic course in this CSV file format which you can see on our YouTube Channel. Now it’s time to see this API in action.
Now before we go ahead, let me walk you through the PDF.co API endpoint documentation which we are going to use in our demo application.
Endpoint
URL: | https://api.pdf.co/v1/pdf/convert/to/csv |
Method: | POST |
Parameter | Description |
Url | points to the source file to be converted. |
name | The filename for the generated output. Must be a String. |
PDF To CSV Demo
PDF to CSV cURL Code Snippet
curl --location --request POST 'https://api.pdf.co/v1/pdf/convert/to/csv' \ --header 'Content-Type: application/json' \ --header 'x-api-key: {{x-api-key}}' \ --data-raw '{ "url": "https://bytescout-com.s3-us-west-2.amazonaws.com/files/demo-files/cloud-api/pdf-to-csv/sample.pdf", "lang": "eng", "inline": "true", "unwrap": "", "pages": "0-", "rect": "", "async": "false", "encrypt": "false", "name": "result.csv", "password": "", "lineGrouping": "", "profiles": "" }'
PDF.co API sample source code is available in many programming languages such as Javascript, Python, PHP, Java, C#, Visual Basic, ASP.NET, Powershell, CLI, etc. You can explore our Hundreds of Source Code Sample Apps on Github. Stay tuned and stay updated with us to get more of these.