pkuseg word segmentation library and its application
1. What is pkuseg
Pkuseg is a new Chinese word segmentation toolkit developed by the language computing and machine learning research group of Peking University. GitHub address: https://github.com/lancopku/pkuseg-python
2. Characteristics
Multi domain word segmentation. Different from the previous general Chinese word segmentation tools, ...
Posted by klapy on Wed, 13 Oct 2021 22:23:03 -0700
Python 3 implementation and improvement of Apriori algorithm
Python 3 implementation and improvement of Apriori algorithm
Code reference machine learning practice The improved methods are partly from data mining: concepts and technologies, and partly from https://blog.csdn.net/weixin_30702887/article/details/98992919 I summarize and implement here, and record my learning of Apriori algorithm
First ...
Posted by narch31 on Wed, 13 Oct 2021 21:22:17 -0700
Python practice - web crawler notes - 2. Crawl public information from the website
Python practice notes - 2. Crawl public information from the website
Zhang Banshu's Python practice notes, including notes and DeBug's experience.
In order to complete a troublesome internship task, I tried to do such a reptile project. One of the task requirements is to search the government affairs disclosure of Anhui emergency management d ...
Posted by JohnResig on Wed, 13 Oct 2021 13:55:32 -0700
❥ ❥ the most complete python operation list in the whole network is coming ❥ ❥ (recommended Collection) (^ -)
Chapter 4 operation list
4.1 traversing the entire list You often need to traverse all the elements of the list and perform the same operation on each element. For example, in a game, you may need to translate each interface element the same distance; For a list containing numbers, you may need to perform the same statistical operation on eac ...
Posted by Rocu on Wed, 13 Oct 2021 08:58:11 -0700
pandas data analysis to awesome tutorial [Full Version]
Data structure of Pandas
Import pandas
from pandas import Series
1,Series
Series is an object similar to a one-dimensional array, which consists of the following two parts:
values: a set of data (darray type)Index: related data index label
1) Creation of Series
There are two creation methods:
(1) Created from a list or numpy array, the ...
Posted by poisedforflight on Wed, 13 Oct 2021 08:51:25 -0700
python crawler - cookie handling of requests module
cookie handling of requests module
Stateless HTTP
HTTP statelessness means that the HTTP protocol has no memory ability for transaction processing, that is, the server does not know what the state of the client is. When we send a request to the server, the server parses the request and then returns the corresponding response. The server is re ...
Posted by ubuntu-user on Wed, 13 Oct 2021 08:37:55 -0700
1+X computer vision textual research some knowledge points
Python
1. Python basic syntax
1.1 Python identifier
In Python, identifiers consist of letters, numbers, and underscores.
In Python, all identifiers can include English, numbers, and underscores (), but cannot begin with a number.
Identifiers in Python are case sensitive.
Identifiers that begin with underscores have special meaning. St ...
Posted by rulinus on Tue, 12 Oct 2021 20:01:30 -0700
Dealing with data imbalance in multi label classification task -- multi label synthetic minority over sampling technology (mlsmote)
When dealing with classification problems, category imbalance is not only a problem we often encounter, but also a problem that often occurs in actual use scenarios. Category imbalance will bring challenges to the prediction task, and lead to poor prediction effect of a few categories, because the hypothetical scenario of most machine le ...
Posted by jacksonpt on Tue, 12 Oct 2021 19:04:53 -0700
Scarpy crawls Dangdang books
catalogue
1: Scarpy
(1) What is Scrapy:
(2) To install the scene:
2. Creation and operation of the scratch project
1. Create a scene project:
2. Proje ...
Posted by Pixelsize on Tue, 12 Oct 2021 18:00:08 -0700
Data processing and visualization -- Fundamentals of Numpy
Data processing and visualization (I) -- Numpy Foundation
1 background
Although the list can complete array operation, it is not an array in the real sense. When the amount of data is large, its speed is very slow, so NumPy extension library is provided to complete array operation. Many advanced extension libraries also rely on it, such as Sc ...
Posted by itarun on Tue, 12 Oct 2021 16:15:04 -0700