pkuseg word segmentation library and its application

1. What is pkuseg Pkuseg is a new Chinese word segmentation toolkit developed by the language computing and machine learning research group of Peking University. GitHub address: https://github.com/lancopku/pkuseg-python 2. Characteristics Multi domain word segmentation. Different from the previous general Chinese word segmentation tools, ...

Posted by klapy on Wed, 13 Oct 2021 22:23:03 -0700

Python 3 implementation and improvement of Apriori algorithm

Python 3 implementation and improvement of Apriori algorithm Code reference machine learning practice The improved methods are partly from data mining: concepts and technologies, and partly from https://blog.csdn.net/weixin_30702887/article/details/98992919 I summarize and implement here, and record my learning of Apriori algorithm First ...

Posted by narch31 on Wed, 13 Oct 2021 21:22:17 -0700

Python practice - web crawler notes - 2. Crawl public information from the website

Python practice notes - 2. Crawl public information from the website Zhang Banshu's Python practice notes, including notes and DeBug's experience. In order to complete a troublesome internship task, I tried to do such a reptile project. One of the task requirements is to search the government affairs disclosure of Anhui emergency management d ...

Posted by JohnResig on Wed, 13 Oct 2021 13:55:32 -0700

❥ ❥ the most complete python operation list in the whole network is coming ❥ ❥ (recommended Collection) (^ -)

Chapter 4 operation list 4.1 traversing the entire list You often need to traverse all the elements of the list and perform the same operation on each element. For example, in a game, you may need to translate each interface element the same distance; For a list containing numbers, you may need to perform the same statistical operation on eac ...

Posted by Rocu on Wed, 13 Oct 2021 08:58:11 -0700

pandas data analysis to awesome tutorial [Full Version]

Data structure of Pandas Import pandas from pandas import Series 1,Series Series is an object similar to a one-dimensional array, which consists of the following two parts: values: a set of data (darray type)Index: related data index label 1) Creation of Series There are two creation methods: (1) Created from a list or numpy array, the ...

Posted by poisedforflight on Wed, 13 Oct 2021 08:51:25 -0700

python crawler - cookie handling of requests module

cookie handling of requests module Stateless HTTP HTTP statelessness means that the HTTP protocol has no memory ability for transaction processing, that is, the server does not know what the state of the client is. When we send a request to the server, the server parses the request and then returns the corresponding response. The server is re ...

Posted by ubuntu-user on Wed, 13 Oct 2021 08:37:55 -0700

1+X computer vision textual research some knowledge points

Python 1. Python basic syntax 1.1 Python identifier In Python, identifiers consist of letters, numbers, and underscores. In Python, all identifiers can include English, numbers, and underscores (), but cannot begin with a number. Identifiers in Python are case sensitive. Identifiers that begin with underscores have special meaning. St ...

Posted by rulinus on Tue, 12 Oct 2021 20:01:30 -0700

Dealing with data imbalance in multi label classification task -- multi label synthetic minority over sampling technology (mlsmote)

When dealing with classification problems, category imbalance is not only a problem we often encounter, but also a problem that often occurs in actual use scenarios. Category imbalance will bring challenges to the prediction task, and lead to poor prediction effect of a few categories, because the hypothetical scenario of most machine le ...

Posted by jacksonpt on Tue, 12 Oct 2021 19:04:53 -0700

Scarpy crawls Dangdang books

catalogue 1: Scarpy           (1)   What is Scrapy:            (2) To install the scene: 2. Creation and operation of the scratch project          1. Create a scene project:          2. Proje ...

Posted by Pixelsize on Tue, 12 Oct 2021 18:00:08 -0700

Data processing and visualization -- Fundamentals of Numpy

Data processing and visualization (I) -- Numpy Foundation 1 background Although the list can complete array operation, it is not an array in the real sense. When the amount of data is large, its speed is very slow, so NumPy extension library is provided to complete array operation. Many advanced extension libraries also rely on it, such as Sc ...

Posted by itarun on Tue, 12 Oct 2021 16:15:04 -0700