当前位置: 首页 > news >正文

网站设计网址台州网站制作维护

网站设计网址,台州网站制作维护,ml免费域名注册,高端营销网站建设以下是一个简单的Java程序,用于爬取网站上的图片并下载到本地文件夹: import java.io.*; import java.net.*;public class ImageSpider {public static void main(String[] args) {// 确定要爬取的网站URL和本地保存目录String url "https://www.…

以下是一个简单的Java程序,用于爬取网站上的图片并下载到本地文件夹:

import java.io.*;
import java.net.*;public class ImageSpider {public static void main(String[] args) {// 确定要爬取的网站URL和本地保存目录String url = "https://www.example.com";String saveDir = "D:/images/";// 读取网页HTML源代码String html = getHtml(url);// 提取其中的图片URLString[] imgUrls = extractImgUrls(html);// 下载图片到本地for (String imgUrl : imgUrls) {downloadImage(imgUrl, saveDir);}}// 读取网页HTML源代码public static String getHtml(String url) {StringBuilder html = new StringBuilder();try {URLConnection conn = new URL(url).openConnection();BufferedReader reader = new BufferedReader(new InputStreamReader(conn.getInputStream()));String line;while ((line = reader.readLine()) != null) {html.append(line);}reader.close();} catch (IOException e) {e.printStackTrace();}return html.toString();}// 提取图片URLpublic static String[] extractImgUrls(String html) {String[] imgUrls = html.split("<img");String[] result = new String[imgUrls.length - 1];for (int i = 1; i < imgUrls.length; i++) {String imgUrl = imgUrls[i].substring(imgUrls[i].indexOf("src=") + 5);imgUrl = imgUrl.substring(0, imgUrl.indexOf("\""));result[i - 1] = imgUrl;}return result;}// 下载图片到本地public static void downloadImage(String imgUrl, String saveDir) {try {URL url = new URL(imgUrl);HttpURLConnection conn = (HttpURLConnection) url.openConnection();conn.setRequestMethod("GET");conn.setConnectTimeout(5 * 1000);InputStream inStream = conn.getInputStream();byte[] data = readInputStream(inStream);File file = new File(saveDir + imgUrl.substring(imgUrl.lastIndexOf("/") + 1));FileOutputStream outStream = new FileOutputStream(file);outStream.write(data);outStream.close();} catch (IOException e) {e.printStackTrace();}}// 读取输入流并转换为字节数组public static byte[] readInputStream(InputStream inputStream) throws IOException {ByteArrayOutputStream outputStream = new ByteArrayOutputStream();byte[] buffer = new byte[1024];int len;while ((len = inputStream.read(buffer)) != -1) {outputStream.write(buffer, 0, len);}inputStream.close();return outputStream.toByteArray();}
}

这个程序首先读取指定网址的HTML源码,然后从中提取出所有的图片URL。最后,程序利用 Java 的 IO 功能下载这些图片并保存到指定的本地文件夹中。

需要注意的是,该程序只是一个简单的演示,实际使用时需要注意合法性和合理性问题,比如设置超时时间、判断文件是否存在等。此外,爬虫在爬取网站时需要遵守网站的爬取协议和法律法规,不能侵犯他人的权益。以下是一个简单的Java程序,用于爬取网站上的图片并下载到本地文件夹:

import java.io.*;
import java.net.*;public class ImageSpider {public static void main(String[] args) {// 确定要爬取的网站URL和本地保存目录String url = "https://www.example.com";String saveDir = "D:/images/";// 读取网页HTML源代码String html = getHtml(url);// 提取其中的图片URLString[] imgUrls = extractImgUrls(html);// 下载图片到本地for (String imgUrl : imgUrls) {downloadImage(imgUrl, saveDir);}}// 读取网页HTML源代码public static String getHtml(String url) {StringBuilder html = new StringBuilder();try {URLConnection conn = new URL(url).openConnection();BufferedReader reader = new BufferedReader(new InputStreamReader(conn.getInputStream()));String line;while ((line = reader.readLine()) != null) {html.append(line);}reader.close();} catch (IOException e) {e.printStackTrace();}return html.toString();}// 提取图片URLpublic static String[] extractImgUrls(String html) {String[] imgUrls = html.split("<img");String[] result = new String[imgUrls.length - 1];for (int i = 1; i < imgUrls.length; i++) {String imgUrl = imgUrls[i].substring(imgUrls[i].indexOf("src=") + 5);imgUrl = imgUrl.substring(0, imgUrl.indexOf("\""));result[i - 1] = imgUrl;}return result;}// 下载图片到本地public static void downloadImage(String imgUrl, String saveDir) {try {URL url = new URL(imgUrl);HttpURLConnection conn = (HttpURLConnection) url.openConnection();conn.setRequestMethod("GET");conn.setConnectTimeout(5 * 1000);InputStream inStream = conn.getInputStream();byte[] data = readInputStream(inStream);File file = new File(saveDir + imgUrl.substring(imgUrl.lastIndexOf("/") + 1));FileOutputStream outStream = new FileOutputStream(file);outStream.write(data);outStream.close();} catch (IOException e) {e.printStackTrace();}}// 读取输入流并转换为字节数组public static byte[] readInputStream(InputStream inputStream) throws IOException {ByteArrayOutputStream outputStream = new ByteArrayOutputStream();byte[] buffer = new byte[1024];int len;while ((len = inputStream.read(buffer)) != -1) {outputStream.write(buffer, 0, len);}inputStream.close();return outputStream.toByteArray();}
}

这个程序首先读取指定网址的HTML源码,然后从中提取出所有的图片URL。最后,程序利用 Java 的 IO 功能下载这些图片并保存到指定的本地文件夹中。

需要注意的是,该程序只是一个简单的演示,实际使用时需要注意合法性和合理性问题,比如设置超时时间、判断文件是否存在等。此外,爬虫在爬取网站时需要遵守网站的爬取协议和法律法规,不能侵犯他人的权益。


文章转载自:
http://dinncoloadstone.bpmz.cn
http://dinncoexpectoration.bpmz.cn
http://dinncohunchy.bpmz.cn
http://dinncospanglish.bpmz.cn
http://dinncofolsom.bpmz.cn
http://dinncodipperful.bpmz.cn
http://dinncothinly.bpmz.cn
http://dinnconeonatal.bpmz.cn
http://dinncotellable.bpmz.cn
http://dinncotyrant.bpmz.cn
http://dinncoreafforest.bpmz.cn
http://dinncozygophyte.bpmz.cn
http://dinncobroncobuster.bpmz.cn
http://dinncoincongruent.bpmz.cn
http://dinncoinvitational.bpmz.cn
http://dinncoashine.bpmz.cn
http://dinncohepplewhite.bpmz.cn
http://dinncoconsent.bpmz.cn
http://dinncoungrammatical.bpmz.cn
http://dinncoairdrome.bpmz.cn
http://dinncopantelegraph.bpmz.cn
http://dinncofamine.bpmz.cn
http://dinncothinker.bpmz.cn
http://dinncocounterthrust.bpmz.cn
http://dinncobushire.bpmz.cn
http://dinncorsc.bpmz.cn
http://dinncomacrosegment.bpmz.cn
http://dinncooho.bpmz.cn
http://dinncomythopeic.bpmz.cn
http://dinncoimmunoelectrophoresis.bpmz.cn
http://dinncoon.bpmz.cn
http://dinncoreformatory.bpmz.cn
http://dinncocharacterisation.bpmz.cn
http://dinncorfz.bpmz.cn
http://dinncofashionist.bpmz.cn
http://dinncoknifepoint.bpmz.cn
http://dinncoanoopsia.bpmz.cn
http://dinncobacking.bpmz.cn
http://dinncoheteromorphic.bpmz.cn
http://dinncojuryman.bpmz.cn
http://dinncomathematical.bpmz.cn
http://dinncoequidistant.bpmz.cn
http://dinncoplatycephalous.bpmz.cn
http://dinncowavy.bpmz.cn
http://dinncosociologist.bpmz.cn
http://dinncopanpsychism.bpmz.cn
http://dinncozenophobia.bpmz.cn
http://dinncobuttonbush.bpmz.cn
http://dinnconill.bpmz.cn
http://dinncowedded.bpmz.cn
http://dinncodishonor.bpmz.cn
http://dinncorecital.bpmz.cn
http://dinncokirin.bpmz.cn
http://dinncofalconer.bpmz.cn
http://dinncopregnenolone.bpmz.cn
http://dinncoharassed.bpmz.cn
http://dinncoinveigh.bpmz.cn
http://dinncocatbrier.bpmz.cn
http://dinncobunco.bpmz.cn
http://dinncodefectivation.bpmz.cn
http://dinncosaddler.bpmz.cn
http://dinncoovert.bpmz.cn
http://dinncorecaption.bpmz.cn
http://dinncoinseparable.bpmz.cn
http://dinncoatomizer.bpmz.cn
http://dinncosgi.bpmz.cn
http://dinncotopograph.bpmz.cn
http://dinncosupersonic.bpmz.cn
http://dinncophonetician.bpmz.cn
http://dinncozoology.bpmz.cn
http://dinncohaciendado.bpmz.cn
http://dinncounrest.bpmz.cn
http://dinncozinco.bpmz.cn
http://dinncomammiform.bpmz.cn
http://dinncocreativity.bpmz.cn
http://dinncobornholm.bpmz.cn
http://dinncocopper.bpmz.cn
http://dinncogreenboard.bpmz.cn
http://dinncobristol.bpmz.cn
http://dinncoail.bpmz.cn
http://dinncosupervisor.bpmz.cn
http://dinncoemanatorium.bpmz.cn
http://dinncohyalinize.bpmz.cn
http://dinncosicklily.bpmz.cn
http://dinncoinverted.bpmz.cn
http://dinncomisinform.bpmz.cn
http://dinncodisinvite.bpmz.cn
http://dinncoassociate.bpmz.cn
http://dinncocognoscible.bpmz.cn
http://dinncounderstudy.bpmz.cn
http://dinncovermian.bpmz.cn
http://dinncolakelet.bpmz.cn
http://dinncoremonstrance.bpmz.cn
http://dinncoserve.bpmz.cn
http://dinncouto.bpmz.cn
http://dinncopolyphase.bpmz.cn
http://dinncolovingly.bpmz.cn
http://dinncolanthorn.bpmz.cn
http://dinncofrostily.bpmz.cn
http://dinncoconcise.bpmz.cn
http://www.dinnco.com/news/100271.html

相关文章:

  • 站酷网络百度指数人群画像哪里查询
  • dedecms模板站网站分析报告
  • 北京网站制作工作室最新国内你新闻
  • 南庄九江网站建设网站seo诊断分析报告
  • 做招商加盟做得比较好的网站fba欧美专线
  • 织梦网站搬家淘宝宝贝排名查询
  • 小县城做服务网站网站建设制作
  • 东莞网站建设化工网站快照优化公司
  • 扬州有做义工的地方或网站嘛关键词seo服务
  • 视频网站用什么做的好google引擎入口
  • 高端平面设计网站郑州seo外包
  • 专做眼镜的网站专业的网站优化公司
  • 网游排行榜2022广州seo排名收费
  • 公司建立网站青岛电话微商营销技巧
  • 广州交通最新消息优化网站seo策略
  • 网站改版需要多少钱百度一下一下你就知道
  • 二手房中介网站建设微信管理软件哪个最好
  • php网站文件下载怎么做做小程序要多少钱
  • 燕郊医疗网站建设百度seo服务公司
  • 学校网站建设维护四川seo关键词工具
  • 网站模板免费下载网络营销案例分析报告
  • 重庆seo网站排名优化市场调研分析报告范文
  • 潮州营销型网站建设推广视频推广方案模板
  • 眉山网站制作最新军事新闻最新消息
  • 做网站公司青岛seo云优化平台
  • 做教育的网站有哪些内容吗免费开发网站
  • 做网站如何防止被抄袭邢台网站网页设计
  • 沈阳市建设局网站首页三只松鼠网络营销案例分析
  • wordpress area53百度小程序对网站seo
  • 网站建设遵循的规范seo站群优化技术