Skip to content

NJNUInformationExtraction/InformationExtraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 

Repository files navigation

InformationExtraction使用说明

注意点1

所有类型的抽取主方法都继承自InfoExtract类的如下方法:

public abstract Extractable extractInformation ( String html );

注意点2

public abstract Extractable extractInformation ( String html ):

方法的参数是html字符串(在方法内部会有可选择的清洗);  
方法的返回值是Extractable类型(下面讲);

#### 注意点3

> Extractable类里面的唯一一个 *field* 是:  
```java
protected ArrayList<Pair<String, String>> data = new ArrayList<>();

向Extractable实例里面添加一个Pair对有如下两种方法:

//方法1:放如Pai对所需要的键与值,由put方法生成Pair对
public void put(String key, String value);
//方法2:直接放入一个Pai对
public void put(Pair<String, String> pair);

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages