柚子快報激活碼778899分享:自己寫的爬蟲小案例
柚子快報激活碼778899分享:自己寫的爬蟲小案例
網(wǎng)址:aHR0cDovL2pzc2NqZ3B0Lmp4d3JkLmdvdi5jbi8/dXJsPS92aWV3L3dvcmtpbmdVbml0L3dvcmtpbmdVbml0Lmh0bWw=
這串代碼能夠爬取勘察單位企業(yè)的詳細(xì)信息。?
import requests
import time
import csv
f = open('勘察單位公司信息.csv','w',encoding='utf-8',newline='')
csv_writer = csv.DictWriter(f,fieldnames=[
'公司名稱',
'涉及領(lǐng)域',
'所屬省市',
'注冊資本(萬元)',
'成立日期',
'發(fā)證機(jī)關(guān)',
'統(tǒng)一社會信用碼',
'經(jīng)營范圍',
'單位網(wǎng)址',
'法人',
'主營業(yè)務(wù)',
'注冊地址',
'證書網(wǎng)址',
])
csv_writer.writeheader()
cookies = {
'JSESSIONID': 'BD8F06872E4D854606108A0F4AF55D79',
}
def get_response(id):
info_url = f'http://jsscjgpt.jxwrd.gov.cn/api/employer/findByEmployer?unitId={id}'
result = requests.get(url=info_url,headers=headers).json()
return result
headers = {
'Accept': 'application/json, text/javascript, */*; q=0.01',
'Accept-Language': 'zh-CN,zh;q=0.9',
'Cache-Control': 'no-cache',
'Connection': 'keep-alive',
'Content-Type': 'application/json',
# 'Cookie': 'JSESSIONID=BD8F06872E4D854606108A0F4AF55D79',
'Origin': 'http://jsscjgpt.jxwrd.gov.cn',
'Pragma': 'no-cache',
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36',
'X-Requested-With': 'XMLHttpRequest',
}
for page in range(1,299):
time.sleep(1)
json_data = {
'pageNum': page,
'pageSize': 10,
'query': '1',
'unitName': '',
'unitCode': '',
'userName': '',
}
response = requests.post(
'http://jsscjgpt.jxwrd.gov.cn/api/employer/findAll',
cookies=cookies,
headers=headers,
json=json_data,
verify=False,
)
json_response = response.json()
for index in json_response['data']['list']:
id = index['companyId']
data = get_response(id)['data']
dit = {
'公司名稱': data['name'],
'涉及領(lǐng)域': data['categoryType'],
'所屬省市': data['address'],
'注冊資本(萬元)': data['registeredMoney'],
'成立日期': data['setUpYear'],
'發(fā)證機(jī)關(guān)': data['certificateCompany'],
'統(tǒng)一社會信用碼': data['code'],
'經(jīng)營范圍': data['businessRange'],
'單位網(wǎng)址': data['companyUrl'],
'法人': data['legalPerson'],
'主營業(yè)務(wù)': data['mainBusiness'],
'注冊地址': data['registeredAddress'],
'證書網(wǎng)址': data['companyCommitment'],
}
print(dit)
csv_writer.writerow(dit)
柚子快報激活碼778899分享:自己寫的爬蟲小案例
相關(guān)閱讀
本文內(nèi)容根據(jù)網(wǎng)絡(luò)資料整理,出于傳遞更多信息之目的,不代表金鑰匙跨境贊同其觀點和立場。
轉(zhuǎn)載請注明,如有侵權(quán),聯(lián)系刪除。