Display Documents on mobile, tablet and desktop

The world’s most flexible document viewing solution

Download Trial Request a quote

How to run JPDF2HTML5 Converter on the Amazon Cloud

This tutorial explains how to setup the IDRsolutions JPDF2HTML5 converter or the JPDFForms converter on a Cloud service with a simple example. Both products utilize the same api and will be interchangable in the web application that we will create. The only difference will be which jar file you will use. This application will let you upload a file, convert that file into a HTML page and then will provide a link so that you can see the HTML page. It will be using the JPDF2HTML5-trial.jar which you can download here or the JPDFForms trial jar which you can request from here.

If you are new to cloud, you may want to read the introductory article on the Java PDF blog ‘How to set up Amazon Cloud/AWS Elastic Beanstalk on the NetBeans IDE’ before you start.

What you will need to have done before you start:

Create a new project

First lets create a Web application. Right click in your Projects window and select New Project. Then choose the Java Web categories Web Application project. Click Next and give the project a meaningful name.

1

On the Server and Settings screen you should select your Amazon BeanStalk environment that you created in the previous tutorial. Java EE should be set to version 6. Select Finish.

jpdfServerAndSettings

You should get a project that’s structure resembles this:

3

At this point you can run your application to ensure it is set up and will correctly deploy. You can do this by right clicking the top-most node of your project and clicking Run. Optionally you can click deploy but run will open your application in your web browser for you to view after deployment.

Modify index.jsp

Index.jsp lives in the Web Pages directory, which is where files should be placed if you want clients to be able to access them via their web browsers. Index.jsp will display a HTML form on the client’s computer, which will let you select a file and upload it to a servlet. The servlet works on the server’s machine and does all of the heavy work so that the client machine can just sit back and wait for it’s file to be converted. If you have an index.html then delete this so your index.jsp is always used as the default web page upon starting your application.
 
You also need to add a form with two elements: File and Submit. Your file should be given an ID as “pdfFile”. This is important because later on the servlet will use this ID to extract the file from the form request.
 
The form will post to a servlet called PDF2HTML5converter and we will create that in the next step. 

Your index file should look like this:

Create the servet / PDF2HTML5converter

At the end of this tutorial I will post the full code for this class. Those of you who feel fairly competent with web applications can just skip down there to browse the code. Everyone else can use the code as a quick reference to make sure they are on the right track.

This should be created and stored in the Source Packages directory since it is essentially a Java class. To create this right click your Source Packages directory, go to the New menu and select the Other menu item. This will open the New File wizard. Choose from the Web category a Servlet file type. Click Next and name your servlet. Click Finish.

4

As you can see servlets have some predefined methods like doPost() and processRequest(). When the servlet receives a request via the post method it’s doPost() method is called. This method calls processRequest() and passes in the HttpServletRequest and a HttpServletResponse. We can get any data that was passed along with the HttpServletRequest e.g. a file and similarly attach data to the HttpServletResponse e.g. HTML.
 

@MultipartConfig is needed to tell the servlet to expect requests conforming to the multipart/form-data MIME type (so that requests containing a file can be retrieved).
@WebServlet(urlPatterns = {“/yourServletName”}) should already be supplied. It is used to specify that the servlet is available at the specified URL pattern. This is how your index.jsp can locate your servlet when sending the form.
 

This class has 3 jobs so we will make 3 methods and call them in the processRequest method. Your new ProcessRequest() should look like below. We pass the request into getFile() because we need the request in order to extract the file from it. Similarly response is passed into generateOutput() because that method that will carry out the download, which will be put into the response for the client.

Copy PDF file to local file on server / getFile()

This is probably the trickiest method of the three. This is because we are going to copy the file to the server and different servers have different file systems. In this example we are using a TomCat server on Amazon Beanstalk.

Convert the data into a HTML page / convertPdf2Html5()

Before we can start this method we need to add the jar to the project library. If we don’t then we can’t use the functions and classes we need to convert the file into a HTML file. To add the jar, right click your project’s top most node and select Properties. Look down the category tree on the left and select Libraries. Then select the Add JAR/Folder button and navigate to your Jpdf2html5 jar OR your JPDFForms jar. Click Open and then OK.

5

This is the easiest method. We can use the example code from here to help us. Since we aren’t using any options we can just create the HTMLConversionOptions() and IDRViewerOptions() as new objects. The Converter’s convert method takes our byte[] and output file as parameters.

Display link to output / generateOutput()

Create the response that will be sent back to the client’s browser with a link to our newly generated output.

Deploy and run application

Now that all of your methods are complete it is time to test your application. Right click your project’s top-most node and click Run. You should be greeted with a HTML page like this:

6

Choose your file and click the convert button. Please note that large PDF sizes may cause the server to time out. If this happens you will receive a Proxy Error with the reason: Error reading from remote server. The page below should be displayed if there are no errors.

7

Once you click that link you should see something similar to below but with your converted PDF.

8

Complete PDF2HTML5converter code

IDRSolutions Limited 1999-2016