应用POI,word2007转html
来源:互联网 发布:小米44g网络怎样设置 编辑:程序博客网 时间:2024/05/17 22:10
poi 3.9
http://poi.apache.org/
- import java.io.File;
- import java.io.FileInputStream;
- import java.io.FileOutputStream;
- import java.io.IOException;
- import java.io.InputStream;
- import java.io.OutputStream;
- import org.apache.poi.xwpf.converter.core.FileImageExtractor;
- import org.apache.poi.xwpf.converter.core.FileURIResolver;
- import org.apache.poi.xwpf.converter.xhtml.XHTMLConverter;
- import org.apache.poi.xwpf.converter.xhtml.XHTMLOptions;
- import org.apache.poi.xwpf.usermodel.XWPFDocument;
- import org.apache.poi.xwpf.usermodel.XWPFPictureData;
- //import org.junit.Assert;
- //import org.junit.Test;
- public class word07toHtml {
- //@Test
- public static void canExtractImage() throws IOException {
- File f = new File("d:/test/test.docx");
- if (!f.exists()) {
- System.out.println("Sorry File does not Exists!");
- } else {
- if (f.getName().endsWith(".docx") || f.getName().endsWith(".DOCX")) {
- // 1) Load DOCX into XWPFDocument
- InputStream in = new FileInputStream(f);
- XWPFDocument document = new XWPFDocument(in);
- // 2) Prepare XHTML options (here we set the IURIResolver to
- // load images from a "word/media" folder)
- File imageFolderFile = new File("d:/test/media");
- XHTMLOptions options = XHTMLOptions.create().URIResolver(
- new FileURIResolver(imageFolderFile));
- options.setExtractor(new FileImageExtractor(imageFolderFile));
- //options.setIgnoreStylesIfUnused(false);
- //options.setFragment(true);
- // 3) Convert XWPFDocument to XHTML
- OutputStream out = new FileOutputStream(new File(
- "d:/test/test.htm"));
- XHTMLConverter.getInstance().convert(document, out, options);
- } else {
- System.out.println("Enter only MS Office 2007+ files");
- }
- }
- }
- public static void main(String args[]) {
- try {
- canExtractImage();
- } catch (IOException e) {
- // TODO Auto-generated catch block
- e.printStackTrace();
- }
- }
- }
其中org.apache.poi.xwpf.converter需要扩展包
如果你的项目用到了maven做如下配置即可,若果没用maven,请从本文附件下载
1.0.4 对应的是 poi 3.9
1.0.0 对应的是 poi 3.8
import org.apache.poi.xwpf.converter.xhtml.XHTMLConverter;
import org.apache.poi.xwpf.converter.xhtml.XHTMLOptions;
所需jar包
- <dependencies>
- <dependency>
- <groupId> fr.opensagres.xdocreport</groupId>
- <artifactId> org.apache.poi.xwpf.converter.core</artifactId>
- <version> 1.0.4</version>
- </dependency>
- <dependency>
- <groupId> fr.opensagres.xdocreport</groupId>
- <artifactId> org.apache.poi.xwpf.converter.xhtml</artifactId>
- <version> 1.0.4</version>
- </dependency>
- </dependencies>
如果报错:
java.lang.ClassNotFoundException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTSectPrImpl$1HeaderReferenceList
请添加 ooxml-schemas-1.1.jar
java.lang.ClassNotFoundException: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTBodyImpl$1TblList
也是需要 ooxml-schemas-1.1.jar
用maven的会自动下来,没用maven的请从本文附件下载ooxml-schemas-1.1.rar,需要解压
不过,发现转换后的table没有边框,有待解决
1 0
- 应用POI,word2007转html
- poi word转html
- java通过Apache开源框架POI读取Word2003和Word2007转换成html的demo实例
- word2007应用技巧
- poi操作word模板(word2003,word2007)
- POI动态生成word2007加强版
- POI 读取word (word2003 和 word2007)
- java poi 替换word2007中的指定文本
- poi实现Excel转html
- 读取 Excel(POI) 转 HTML
- POI 把 excel 转 html
- html转word,使用jacob和poi
- POI Word DOC格式转Html
- 利用POI进行word转html
- POI实现DOC/DOCX转HTML
- java+poi实现word转html显示
- (四) POI word 转 html 文档
- poi转excel,word文档为html
- 一张图记住常用 Linux 系统性能诊断工具
- android小功能实现之xml文件解析(Pull)
- uploadify struts2实现图片上传
- 各种浏览器UserAgent一览表(桌面+移动)
- Spring集成Quartz定时任务框架介绍和Cron表达式详解
- 应用POI,word2007转html
- 约瑟夫问题*
- iOS学习之UITableView(三):进阶篇索引,标记和自定义的table
- JAVAEE就业--HTML语言总结
- android 控件之ProgressBar实现双进度条(通常用于流媒体的缓冲区进度)
- VLC程序宏及线程分析
- 读取redis中的数据时出现:MISCONF Redis is configured to save RDB snapshots
- SQL中适用case when then来转化数据库中的信息
- hdu2955