下面是三个java例子,关于读取wps/et/dps的方法
1.读取wps(读取文本): 通过流加载wps文件,读取文字内容
import com.spire.doc.*
import java.io.File
import java.io.FileInputStream
import java.io.FileWriter
import java.io.IOException
public class ReadTextFromWPS {
public static void main(String[] args) throws IOException{
//通过流加载WPS文字文档
FileInputStream inputStream = new FileInputStream(new File("test.wps"))
Document doc = new Document()
doc.loadFromStream(inputStream, FileFormat.Doc)
//获取文本保存为String
String text = doc.getText()
//将String写入Txt
writeStringToTxt(text,"读取WPS文本.txt")
}
public static void writeStringToTxt(String content, String txtFileName) throws IOException {
FileWriter fWriter= new FileWriter(txtFileName,true)
try {
fWriter.write(content)
}catch(IOException ex){
ex.printStackTrace()
}finally{
try{
fWriter.flush()
fWriter.close()
} catch (IOException ex) {
ex.printStackTrace()
}
}
}
}
2. 读取et:直接加载et格式的表格文件,读取数据
import com.spire.xls.*
public class ExcelToText {
public static void main(String[] args) {
//加载et格式的表格文件
Workbook workbook = new Workbook()
workbook.loadFromFile("test.et")
//获取工作表
Worksheet sheet = workbook.getWorksheets().get(0)
//获取指定单元格中的文本数据
CellRange range = sheet.getCellRange("A1")
String text = range.getText().trim()
System.out.println(text)
}
}
3.读取dps:直接加载dps格式的幻灯片文档,读取文本
import com.spire.presentation.IAutoShape
import com.spire.presentation.ISlide
import com.spire.presentation.ParagraphEx
import com.spire.presentation.Presentation
import java.io.FileWriter
public class ExtractText {
public static void main(String[]args) throws Exception{
//加载测试文档
Presentation ppt = new Presentation()
//ppt.loadFromFile("test.pptx")
ppt.loadFromFile("test.dps")
StringBuilder buffer = new StringBuilder()
//遍历文档中的幻灯片,提取文本
for (Object slide : ppt.getSlides())
{
for (Object shape : ((ISlide) slide).getShapes())
{
if (shape instanceof IAutoShape)
{
for (Object tp : ((IAutoShape) shape).getTextFrame().getParagraphs())
{
buffer.append(((ParagraphEx) tp).getText())
}
}
}
}
//保存到文本文件
FileWriter writer = new FileWriter("ExtractTextfromDPS.txt")
writer.write(buffer.toString())
writer.flush()
writer.close()
}
}
这里须在Java程序中导入spire.office.jar文件。