当前位置: 首页 > wzjs >正文

手机app开发网站建设关键词排名规则

手机app开发网站建设,关键词排名规则,江苏优化网站哪家好,做seo网站诊断书怎么做问题描述: 原始数据存储在一个.txt文件中,存储格式如下: 如何将按照上面格式存储的内容,修改成下面的格式(方便DiaCOQE处理): 问题解决: from pdb import set_trace as stop impor…

问题描述:

原始数据存储在一个.txt文件中,存储格式如下:

如何将按照上面格式存储的内容,修改成下面的格式(方便DiaCOQE处理):

问题解决:

from pdb import set_trace as stop
import os
import re
from tqdm import trange
generated_path= "/home/qtxu/DiaCOQE/data/Ele-COQE/dev.txt"
Unicoqe_path = "/home/qtxu/DiaCOQE/data/Ele-COQE/devss.txt"raw_data = []
with open(os.path.join(generated_path), 'r') as f:for line in f:raw_data.append(line)polarity_dict={'1': 'Better','0': 'Equal','-1': 'Worse','2': 'Different'}def obtain_span_str(cur_str, cur_path):if 'Camera' in cur_path:split_sign = '&&'else:split_sign = '&'cur_list = cur_str.strip()[1:-1].split()cur_span = ''.join(word for index, word in (pair.split(split_sign) if '&' in pair else [pair, ''] for pair in  cur_list))return cur_spanwith open(Unicoqe_path, 'w') as fw:line_id, i = 0, 0text_line, label_line = '', ''for line_id in trange(len(raw_data), desc= "procesing data ……"):cur_line = raw_data[line_id]if len(cur_line.split('\t')) != 2:label_line += '\n' + cur_lineelse:if text_line !='':sent, label = text_line.strip().split("\t")fw.write(sent+"####")label_list = label_line.strip().split('\n\n')span_words =[]all_span= ()for label_i in label_list:cur_span = label_i.strip()[1:-1].split(';')sub, obj,asp, opi, polarity = cur_span[0], cur_span[1], cur_span[2],cur_span[3], cur_span[4]try:sub_span = obtain_span_str(sub, generated_path)obj_span = obtain_span_str(obj, generated_path)asp_span = obtain_span_str(asp, generated_path)opi_span = obtain_span_str(opi, generated_path)if len(polarity) ==2:por_span = ''else:por_span = polarity_dict[polarity[1:-1]] except:print(text_line)stop()span_quintuple = (sub_span, obj_span, asp_span, opi_span, por_span)span_words.append(span_quintuple)fw.write(str('['+', '.join(str(span) for span in span_words)) + "]\n")text_line = cur_linelabel_line=''fw.write(text_line.strip().split("\t")[0]+"####")label_list = label_line.strip().split('\n\n')span_words =[]all_span= ()for label_i in label_list:cur_span = label_i.strip()[1:-1].split(';')# sub, obj,asp = cur_span[0], cur_span[1], cur_span[2]sub, obj,asp, opi, polarity = cur_span[0], cur_span[1], cur_span[2],cur_span[3], cur_span[4]sub_span = obtain_span_str(sub, generated_path)obj_span = obtain_span_str(obj, generated_path)asp_span = obtain_span_str(asp, generated_path)opi_span = obtain_span_str(opi, generated_path)if len(polarity) ==2:por_span = ''else:por_span = polarity_dict[polarity[1:-1]] span_quintuple = (sub_span, obj_span, asp_span,opi_span, por_span)span_words.append(span_quintuple)fw.write(str('['+', '.join(str(span) for span in span_words)) + "]\n")

优化版代码如下:

import os
import re
from tqdm import trange
from pdb import set_trace as stoppolarity_dict={'1': 'Better','0': 'Equal','-1': 'Worse','2': 'Different'}def obtain_span_str(cur_str, cur_path):split_sign = '&&' if 'Camera' in cur_path else '&'cur_list = cur_str.strip()[1:-1].split()cur_span = ''.join(word for index, word in (pair.split(split_sign) if '&' in pair else [pair, ''] for pair in  cur_list))return cur_spandef process_label_line(label_list, generated_path,text_line):span_words =[]for label_i in label_list:sub, obj,asp, opi, polarity  = label_i.strip()[1:-1].split(';')try:sub_span = obtain_span_str(sub, generated_path)obj_span = obtain_span_str(obj, generated_path)asp_span = obtain_span_str(asp, generated_path)opi_span = obtain_span_str(opi, generated_path)if len(polarity) ==2:por_span = ''else:por_span = polarity_dict[polarity[1:-1]] except:print(text_line)stop()span_quintuple = (sub_span, obj_span, asp_span, opi_span, por_span)span_words.append(span_quintuple)return span_wordsdef process_file (read_file, write_file):raw_data = []with open(os.path.join(read_file), 'r') as f:for line in f:raw_data.append(line)with open(write_file, 'w') as fw:line_id = 0text_line, label_line = '', ''for line_id in trange(len(raw_data), desc= "procesing data ……"):cur_line = raw_data[line_id]if len(cur_line.split('\t')) != 2:label_line += '\n' + cur_lineelse:if text_line !='':sent, label = text_line.strip().split("\t")fw.write(sent+"####")label_list = label_line.strip().split('\n\n')span_words = process_label_line(label_list, generated_path, text_line)fw.write(str('['+', '.join(str(span) for span in span_words)) + "]\n")text_line = cur_linelabel_line=''fw.write(text_line.strip().split("\t")[0]+"####")label_list = label_line.strip().split('\n\n')span_words = process_label_line(label_list, generated_path, text_line)fw.write(str('['+', '.join(str(span) for span in span_words)) + "]\n")if __name__ == "__main__":generated_path= "/home/qtxu/DiaCOQE/data/Ele-COQE/dev.txt"write_path = "/home/qtxu/DiaCOQE/data/Ele-COQE/devs3.txt"process_file(generated_path, write_path)

http://www.dtcms.com/wzjs/505985.html

相关文章:

  • 沅江网站设计成都网站seo性价比高
  • 网站建设的软硬件平台网站seo推广员招聘
  • 上海网站建设联系电话抖音seo优化系统招商
  • 便民信息发布平台佛山seo联系方式
  • 开发建设网站多久互联网营销的方式有哪些
  • 洛阳做网站公司有哪些提高工作效率英语
  • 聊城做网站的公司效果做推广哪个平台效果好
  • 新手学做网站教程百度推荐现在为什么不能用了
  • 做网站是靠什么赚钱的网络优化大师app
  • 企业建设营销型网站的目的有网址查询工具
  • 网站商城系统建设南宁正规的seo费用
  • 整形美容网站源码一般开车用什么导航最好
  • 自己建网站做电商还赚钱吗市场营销培训课程
  • 值得关注的网站网络软文发布
  • 西安营销型网站建设动力无限西安网络推广
  • 如何选择赣州网站建设中国万网官网
  • wordpress.com变装湖州seo排名
  • 福州网站建设交换链接的其它叫法是
  • 陕西东盟建设工程有限公司网站南京网络推广优化哪家好
  • 哈尔滨网站建设排行百度seo价格查询
  • 个人网站建设方案模板百度关键词价格怎么查询
  • 飞卢盗版做的最好的网站搜索引擎广告的优缺点
  • 关于集团官方网站内容建设的报告服务营销理论
  • 手机便宜电商网站建设今日头条新闻大事
  • 投注网站建设需要怎样建网站赚钱
  • 申请个人手机网站空间网络营销环境的分析主要是
  • 百度给做的网站如何登陆网站后台怎样建网站卖东西
  • 建网站底部怎么做的搜收录批量查询
  • 网站风格变化市场营销策划方案3000字
  • 网站开发项目策划宣传软文模板