qidao123.com技术社区-IT企服评测·应用市场
标题:
requests库
[打印本页]
作者:
欢乐狗
时间:
6 天前
标题:
requests库
模仿请求获取网页源代码
如下
import requests
r = requests.get('https://spa1.scrape.center/page/1')
print(r.text)
复制代码
源码如下
GET请求
示例网站为 https://httpbin.org
import requests
r = requests.get('https://httpbin.org/')
print(r.text)
复制代码
所表现的效果如下
{
"args": {},
"headers": {
"Accept": "*/*",
"Accept-Encoding": "gzip, deflate",
"Host": "httpbin.org",
"User-Agent": "python-requests/2.32.3",
"X-Amzn-Trace-Id": "Root=1-681dec13-1f8d5f796c04c6d73f09a6b3"
},
"origin": "183.250.93.64",
"url": "https://httpbin.org/get"
}
复制代码
如果想添加两个参数,此中name是germey,age是25
URL可以写成 https://httpbin.org/get?/name=germey&age=25
import requests
date = {
'name': 'germey',
'age': 25
}
r = requests.get('https://httpbin.org/get', params=date)
print(r.text)
复制代码
效果如下
{
"args": {
"age": "25",
"name": "germey"
},
"headers": {
"Accept": "*/*",
"Accept-Encoding": "gzip, deflate",
"Host": "httpbin.org",
"User-Agent": "python-requests/2.32.3",
"X-Amzn-Trace-Id": "Root=1-681ded56-2771b27b0f18cd9b71461ee7"
},
"origin": "183.250.93.64",
"url": "https://httpbin.org/get?name=germey&age=25"
}
复制代码
如果想要直接解析返回效果得到一个JSON格式的数据的话,可以直接调用json方法
import requests
r = requests.get('https://httpbin.org/get')
print(type(r.text))
print(r.json())
print(type(r.json()))
复制代码
效果如下
<class 'str'>
{'args': {}, 'headers': {'Accept': '*/*', 'Accept-Encoding': 'gzip, deflate', 'Host': 'httpbin.org', 'User-Agent': 'python-requests/2.32.3', 'X-Amzn-Trace-Id': 'Root=1-681dee2d-03cd762c0490679f0ef17d91'}, 'origin': '183.250.93.64', 'url': 'https://httpbin.org/get'}
<class 'dict'>
复制代码
或取网页
import requests
import re
r = requests.get('https://ssr1.scrape.center/')
pattern = re.compile('<h2.*?>(.*?)</h2>', re.S)
titles = re.findall(pattern, r.text)
print(titles)
复制代码
免责声明:如果侵犯了您的权益,请联系站长,我们会及时删除侵权内容,谢谢合作!更多信息从访问主页:qidao123.com:ToB企服之家,中国第一个企服评测及商务社交产业平台。
欢迎光临 qidao123.com技术社区-IT企服评测·应用市场 (https://dis.qidao123.com/)
Powered by Discuz! X3.4