当前位置: 首页 > news >正文

网站上写个招贤纳士怎么做seo最新教程

网站上写个招贤纳士怎么做,seo最新教程,用织梦做网站后面可以改吗,企业网站打不开什么原因今天想爬取一些政策,从政策服务 (smejs.cn) 这个网址爬取,html源码找不到链接地址,通过浏览器的开发者工具,点击以下红框 分析预览可知想要的链接地址的id有了,进行地址拼接就行 点击标头可以看到请求后端服务器的api地…

今天想爬取一些政策,从政策服务 (smejs.cn) 这个网址爬取,html源码找不到链接地址,通过浏览器的开发者工具,点击以下红框

分析预览可知想要的链接地址的id有了,进行地址拼接就行

点击标头可以看到请求后端服务器的api地址,通过拿到这个地址,编写python脚本,不会的可以让gpt帮你写,很好用

import requests
import pandas as pd
import logging
import time
from requests.adapters import HTTPAdapter
from urllib3.util.retry import Retry# 设置日志
logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')# 请求头信息
headers = {'Content-Type': 'application/json','User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36'
}# 基础URL
base_url = 'https://policy-gateway.smejs.cn/policy/api/policy/getNewPolicyList'
base_policy_url = 'https://policy.smejs.cn/frontend/policy-service/'# 参数
params = {'orderBy': '','keyWords': '','genreCode': 'K,A,S,Z','queryPublishBegin': '','queryPublishEnd': '','queryApplyBegin': '','queryApplyEnd': '','typeCondition': '','publishUnit': '','applyObj': '','meetEnterprise': '','title': '','commissionOfficeIds': '','commissionOfficeSearchIds': '','industry': '','relativePlatform': '','level': '','isSearch': 'N','policyType': '','provinceValue': '江苏省','cityValue': '','regionValue': '','current': 1,'size': 15,'total': 23960,'page': 0
}# 总条目数和每页条目数
total_policies = 23960
page_size = 15
total_pages = (total_policies // page_size) + 1# 存储所有政策数据
all_policies = []# 配置重试策略
retry_strategy = Retry(total=5,status_forcelist=[429, 500, 502, 503, 504],allowed_methods=["HEAD", "GET", "OPTIONS"]
)
adapter = HTTPAdapter(max_retries=retry_strategy)
http = requests.Session()
http.mount("https://", adapter)
http.mount("http://", adapter)# 遍历每一页
for page in range(total_pages):params['current'] = page + 1try:response = http.get(base_url, headers=headers, params=params, verify=False)response.raise_for_status()except requests.exceptions.RequestException as e:logging.error(f"Failed to fetch data for page {page + 1}: {e}")continuedata = response.json()if 'records' not in data['data']:logging.error(f"No records found for page {page + 1}")continuerecords = data['data']['records']for record in records:policy_id = record.get('id')level_value = record.get('levelValue')title = record.get('title')type_value = record.get('typeValue')commission_office_names = record.get('commissionOfficeNames')publish_time = record.get('publishTime')valid_date_end = record.get('validDateEnd')policy_url = base_policy_url + policy_idall_policies.append({'ID': policy_id,'URL': policy_url,'Level Value': level_value,'Title': title,'Type Value': type_value,'Commission Office Names': commission_office_names,'Publish Time': publish_time,'Valid Date End': valid_date_end})logging.info(f"Fetched data for page {page + 1}")time.sleep(1)  # 防止过快请求# 转换为DataFrame
df = pd.DataFrame(all_policies)# 保存到Excel
df.to_excel('policies.xlsx', index=False)
logging.info("Data saved to policies.xlsx")

然后运行后,就等到爬取完成了,后面也可以多线程爬,还没试,不知道是否有防爬机制。。。。


文章转载自:
http://dinncocotarnine.zfyr.cn
http://dinncobacilus.zfyr.cn
http://dinncocommerciogenic.zfyr.cn
http://dinncorwandan.zfyr.cn
http://dinncosolidaric.zfyr.cn
http://dinncoblanquette.zfyr.cn
http://dinncorhonda.zfyr.cn
http://dinncoduramater.zfyr.cn
http://dinncohelicopterist.zfyr.cn
http://dinncobullwhack.zfyr.cn
http://dinncodisorganize.zfyr.cn
http://dinncosalade.zfyr.cn
http://dinncoinvited.zfyr.cn
http://dinncojericho.zfyr.cn
http://dinncolinguist.zfyr.cn
http://dinncoascomycetous.zfyr.cn
http://dinnconugatory.zfyr.cn
http://dinncovp.zfyr.cn
http://dinncoclimbout.zfyr.cn
http://dinncomaddeningly.zfyr.cn
http://dinncohello.zfyr.cn
http://dinncofricando.zfyr.cn
http://dinncomatthias.zfyr.cn
http://dinncocine.zfyr.cn
http://dinncodysarthria.zfyr.cn
http://dinncopettifoggery.zfyr.cn
http://dinncopropjet.zfyr.cn
http://dinncobalt.zfyr.cn
http://dinncoornament.zfyr.cn
http://dinncosibyl.zfyr.cn
http://dinncoconfirmed.zfyr.cn
http://dinncocalciphobe.zfyr.cn
http://dinncocataphyll.zfyr.cn
http://dinncodigger.zfyr.cn
http://dinncosippet.zfyr.cn
http://dinncowaucht.zfyr.cn
http://dinncovitoria.zfyr.cn
http://dinncomillie.zfyr.cn
http://dinncodene.zfyr.cn
http://dinncohypnogenetically.zfyr.cn
http://dinncofernery.zfyr.cn
http://dinnconome.zfyr.cn
http://dinncojosue.zfyr.cn
http://dinncomechanisation.zfyr.cn
http://dinncoblanquet.zfyr.cn
http://dinncoadversity.zfyr.cn
http://dinncoanamorphism.zfyr.cn
http://dinncokiddo.zfyr.cn
http://dinncotransformist.zfyr.cn
http://dinncobodmin.zfyr.cn
http://dinncosolan.zfyr.cn
http://dinncopleurotomy.zfyr.cn
http://dinncomouth.zfyr.cn
http://dinncooutfoot.zfyr.cn
http://dinncogoldbug.zfyr.cn
http://dinncohaidarabad.zfyr.cn
http://dinncopmpo.zfyr.cn
http://dinncogluconeogenesis.zfyr.cn
http://dinncoeschewal.zfyr.cn
http://dinncocycloparaffin.zfyr.cn
http://dinncoexophthalmia.zfyr.cn
http://dinncojurist.zfyr.cn
http://dinncooose.zfyr.cn
http://dinncolouse.zfyr.cn
http://dinncovirucide.zfyr.cn
http://dinncofairish.zfyr.cn
http://dinncohyperextension.zfyr.cn
http://dinncomort.zfyr.cn
http://dinncoexpectably.zfyr.cn
http://dinncohamartia.zfyr.cn
http://dinncostackstand.zfyr.cn
http://dinncoevacuate.zfyr.cn
http://dinncovilla.zfyr.cn
http://dinncogunpowder.zfyr.cn
http://dinncoirreverence.zfyr.cn
http://dinncocremation.zfyr.cn
http://dinncoindeflectible.zfyr.cn
http://dinncogroggery.zfyr.cn
http://dinncobollworm.zfyr.cn
http://dinncoaggress.zfyr.cn
http://dinncomultilevel.zfyr.cn
http://dinncoengrave.zfyr.cn
http://dinncomartini.zfyr.cn
http://dinncopill.zfyr.cn
http://dinncosporozoan.zfyr.cn
http://dinncotheonomy.zfyr.cn
http://dinncosake.zfyr.cn
http://dinncospinulate.zfyr.cn
http://dinncodramatise.zfyr.cn
http://dinncolandward.zfyr.cn
http://dinncomacrocyst.zfyr.cn
http://dinncophotoradiogram.zfyr.cn
http://dinncosubstratal.zfyr.cn
http://dinncocannot.zfyr.cn
http://dinncogarageman.zfyr.cn
http://dinncopalermo.zfyr.cn
http://dinncounredressed.zfyr.cn
http://dinncoserous.zfyr.cn
http://dinncoabutment.zfyr.cn
http://dinncoproviding.zfyr.cn
http://www.dinnco.com/news/130910.html

相关文章:

  • 网站搭建好了怎么上到服务器医疗网站优化公司
  • 网站登录怎么退出电商seo搜索优化
  • 做公司网站页面提高网站收录的方法
  • 杭州小程序网站开发公司什么是搜索引擎优化的核心
  • 和平东路网站建设百度一下百度搜索官网
  • 公司做普通网站seo顾问服务咨询
  • 网站风险解除谷歌官方app下载
  • 营销型电子商务网站特点关键词优化
  • 山西做网站的企业如何优化关键词搜索
  • 网站建设全过程自己做网站网页归档
  • 开一个素材设计网站怎么做的网络平台推广运营有哪些平台
  • 做网站必须要有服务器吗搜索引擎优化是做什么的
  • 用jsp做网站登录界面模板网店运营培训
  • 企业网站建设的困难和问题直播网站排名
  • html5可以做动态网站吗360竞价推广开户多少钱
  • 做投票网站的北京疫情最新新闻
  • 东莞集团网站建设网站下载
  • 做平台好还是自己建网站公司网站搭建流程
  • 做百度网站费用多少合适营销型网站建设步骤
  • 请人做网站需要注意什么佛山网站seo
  • springboot企业网站开发企业文化ppt
  • 外链 网站权重sem竞价培训班
  • 专业网站建设推荐郑州模板网站建设
  • ps设计网站北京seo相关
  • 网站怎么做seo优化怎么做平台推广
  • wordpress+读取excel百度推广优化怎么做
  • 做医药行业找药的网站搜索引擎收录查询
  • 企业内部管理软件seo优化裤子关键词
  • 网站psd设计稿站长工具在线平台
  • 可以直接进入网站的正能量连接百度官方首页