Python 100 line code easily crawls the 100G set diagram of Meizi net. I hope you have enough memory on your network disk [with source code]
preface
Recently, I was doing monitoring related supporting facilities, and found that many scripts are based on python. I heard a long time ago that its name is short of life. I learned python, which is not a joke. With the rise of artificial intelligence, machine learning and deep learning, most of the AI code on the market is written in Pyt ...
Posted by miasma on Tue, 26 May 2020 00:19:00 -0700
Challenge to crawl 100 videos per minute. As long as the network speed is fast enough, anti crawl will not catch up with me
preface
Most of the good videos are short videos! The same interface returns different videos to users
Today, I will take you down the video recommended by the system!
Knowledge points
1. Dynamic data capture demonstration
2. json data analysis method
3. Video data saving
Environment introduction
python 3.6
pycharm
requests
json
General thinkin ...
Posted by nvidia on Wed, 13 May 2020 06:56:07 -0700
manjaro Configuration Overview
1 Source
Arrange the speed of Chinese mirrors, select one.
sudo pacman-mirrors -i -c China -m rank
/etc/pacman.conf plus:
[archlinuxcn]
SigLevel=Optional TrustedOnly
Server=https://mirrors.ustc.edu.cn/archlinuxcn/$arch
To update:
sudo pacman -Syy && sudo pacman -S archlinuxcn-keyring
2 Input Method
sudo pacman -S fcitx-im fcitx-confi ...
Posted by geroido on Sat, 09 May 2020 19:53:15 -0700
50 lines of code for you! Hand to hand teaching you how to make the video barrage into the desired shape
Preface
Station B as a barrage video website, has the so-called barrage culture, so let's see next, what is the most barrage in a video?
Knowledge points:
1. Basic process of reptile
2. Regular
3. requests
4. jieba
5. csv
6. wordcloud
Development environment:
Python 3.6
Pycharm
Python section
Step:
import reimport requestsimport csv
1. Det ...
Posted by hmmm on Tue, 05 May 2020 07:32:56 -0700
Python - modules and packages (common modules)
Modular
In Python, a. py file is called a module
Greatly improves code maintainability
You don't have to start from scratch to write code. When a module is written, it can be referenced elsewhere
vim mymod.py
import mymod
mymod.hello()
mymod.world()
package
What if different people write the same module name? ...
Posted by luke101 on Mon, 04 May 2020 09:22:38 -0700
Sesame HTTP: the pit of scikit learn Bayesian text classification
Basic steps:
1. Training material classification:
I refer to the official directory structure:
Put the corresponding text, a txt file and a corresponding article in each directory: as follows
It should be noted that the proportion of all materials should be kept at the same proportion (adjusted according to the training results, the ...
Posted by KevinCB on Sat, 02 May 2020 13:36:22 -0700
Sesame HTTP: setup of crawler Selenium+Chrome agent
Micro blog login limits the number of errors... In addition, a large number of Cookie accounts are blocked and need to be removed from the Cookie pool... Need to use a proxy... Rogue Baidu has been special for most of the day??? It turns out that Google can solve the problem in minutes? What else can baidu do besides sell fake medicine
The Sele ...
Posted by Arbitus on Sat, 02 May 2020 01:10:23 -0700
Developing Restful api based on Django framework
In the last article, we explained the main concepts of restful api to let you have a preliminary understanding. In this article, we will learn to develop the interface of restful architecture through the Django framework of python. 1, Development environment Development language: Python 3.6IDE :PycharmFramework: Django 2.2.12Test tool: Postman ...
Posted by Sonic_Rage on Sat, 25 Apr 2020 23:25:04 -0700
Python reptile actual battle: hands-on teaching you to crawl agricultural product data (with code)
Preface
The text and pictures of the article are from the Internet, only for learning and communication, and do not have any commercial use. The copyright belongs to the original author. If you have any questions, please contact us in time for handling.
Crawler's website: Wanbang international group. Founded in 2010 and headquartered in Zhen ...
Posted by icedude on Sat, 25 Apr 2020 08:53:46 -0700
Configuration and beautification of Ubuntu after installation
Configuration and Beautification after Ubuntu Installation (2)
The previous article described a series of basic configurations after the installation of ubuntu, which are already available for daily use. This article describes the installation of IDE and some other configurations.
1. Install SSR
Download SSR Client
git clone https://githu ...
Posted by tbare on Tue, 21 Apr 2020 09:23:20 -0700