Java pdf转word实例

来源:互联网 发布:手机淘宝滚动图 编辑:程序博客网 时间:2024/06/06 00:06

写了一个简单的转换,经测试是可以的,不过发现这个转换有大小的限制,小文件是转换成功的

private static void pdtToWord() {File file = new File("C:/Users/Administrator/Desktop/txt-book/Jpa/JPA教程.pdf");PDDocument doc = null;try {doc = PDDocument.load(file);} catch (IOException e) {e.printStackTrace();}int pagenumber=doc.getNumberOfPages();//获取总页数FileOutputStream fos = null;try {fos = new FileOutputStream("C:/Users/Administrator/Desktop/txt-book/Jpa/JPA教程.doc");} catch (FileNotFoundException e) {e.printStackTrace();}Writer writer = null;try {writer = new OutputStreamWriter(fos,"UTF-8");} catch (UnsupportedEncodingException e) {e.printStackTrace();}PDFTextStripper stripper = null;try {stripper = new PDFTextStripper();} catch (IOException e) {e.printStackTrace();}stripper.setSortByPosition(true);//排序stripper.setStartPage(1);//设置转换的开始页stripper.setEndPage(pagenumber);//设置转换的结束页try {stripper.writeText(doc,writer);} catch (IOException e) {e.printStackTrace();}try {writer.close();doc.close();} catch (IOException e) {e.printStackTrace();}}

使用maven导入jar包

<dependency>   <groupId>org.apache.pdfbox</groupId>   <artifactId>pdfbox</artifactId>    <version>2.0.0-RC3</version></dependency>


原创粉丝点击