cnsipo fetches data from SIPO of China.
Please take the following steps(step 1-5 are essential, others are optional):
-
get all patent ids for each year(1985-2014) of each kind (kind: 1-发明公布 2-发明授权 3-实用新型 4-外观设计)
python cnsipo/patent_list.py -k{kind} {year} -i {input_dir} -o {output_dir}output:
{input_dir}/{kind}-{year}.html (cached for later use) {output_dir}/{year}/{page_index} -
merge id files(result of step 1) for each year
bin/merge.sh output_dir path_to_year_dir/{year}output:
{output_dir}/{year} -
fetch patents' details from the id files(result of step 2) of each kind (detail_kind: 1-详细信息 2-事务数据)
python cnsipo/patent_detail.py -k{kind} -K{detail_kind} {year} -i {input_dir} -o {output_dir}output:
{output_dir}/{year} -
create a table on a (Postgres) database(d: detail, t: transaction)
bin/initdb.sh -d{database} -u{db_user} -t{db_table} d|toutput:
a table in database -
import data into database
python cnsipo/patent_db.py -d{database} -u{db_user} -p{password} -t{db_table} -i {input_dir} -K{detail_kind} {year}output:
data in database -
create an auxiliary table on a (Postgres) database
bin/initdb.sh -d{database} -u{db_user} -t{db_table} aoutput:
a table in database -
collect auxiliary data into database
python cnsipo/patent_aux_db.py -d{database} -u{db_user} -p{password} -t{patent_table} -a{aux_table} {year}output:
data in database -
create a UIG(university/industry/government) table on a (Postgres) database
bin/initdb.sh -d{database} -u{db_user} -t{db_table} uoutput:
a table in database -
collect UIG data into database
python cnsipo/patent_uig_db.py -d{database} -u{db_user} -p{password} -i{uig_table} -t{patent_table} -a{aux_table} {year}output:
data in database -
print UIG nodes for each year
python cnsipo/patent_report.py -d{database} -u{db_user} -p{password} -i{uig_table} -a{aux_table} {year} -o {output_dir}output:
{output_dir}/node{year}
-
C:\Program Files\Tencent\QQ\I18N\2052\LocList.xml
Copyright 2014-2015 Hui Zheng
Released under the MIT License.