
group and enhance the competitiveness of enterprises. The
main work of this paper is to improve the K-means clustering
algorithm and integrate a single clustering algorithm with the
idea of ensemble learning, and then apply the clustering
ensemble algorithm to customer segmentation of cross-border
e-commerce.
Generally speaking, customer segmentation can be carried
out according to the following three customer attributes [12]:
(1)External attribute
For example, the geographical distribution of customers, the
products owned by customers, and the organizational
ownership of customers-enterprise users, individual users,
government users, etc. This kind of stratification is usually the
simplest and most intuitive, but at the same time it is also a
relatively extensive classification. We still don't know which
customers contribute more to the enterprise and those
customers contribute less to the enterprise at every customer
level.
(2)Intrinsic attribute
Intrinsic attributes are attributes determined by the internal
factors of customers, such as gender, age, beliefs, hobbies,
income, family members, credit, personality and value
orientation, etc.
(3)Characteristics of consumption behavior
According to the consumer behavior, we can master the real
consumer habits and tendencies of customers, and usually get
ideal results in practice. However, classification according to
consumption behavior also has its limitations. It can only be
applied to existing customers. For potential customers, because
consumption behavior has not yet started, of course,
classification is impossible.
Customer segmentation can generally be divided into five
steps (as shown in Figure 1):
Figure 1 Customer segmentation steps
(1)Subdivision of general characteristics of customers
In order to classify customers according to these
characteristics, the main factors that should be considered are:
regional characteristics, such as urban or rural areas, urban
scale and urban economic development level; Education
background, such as age, sex, education level, nature of work
unit, position or level; Psychological factors, such as
personality characteristics, moral development level, etc.
(2)Customer value segmentation
Customers' contribution to enterprises is different according
to their own consumption level. Therefore, after customers are
subdivided according to their general characteristics, they
should be divided into several grades according to their
contribution to the enterprise, such as high-quality customers,
potential customers, general customers, small customers and
blacklist customers, etc.
(3)Customer common demand segmentation
On the basis of the first two steps of subdivision, select the
high-quality customers and potential high-quality customers in
the enterprise as the target. Analyze the demand characteristics
of all kinds of customers, and formulate enterprise strategies
under the guidance of customer demand, and finally provide
personalized products and services for each customer group.
(4)Select a clustering method suitable for enterprise data
characteristics
Clustering algorithm is an unsupervised learning algorithm.
When using clustering technology to subdivide customers, we
should choose the appropriate algorithm according to the needs
of enterprises, the characteristics of customers and the collected
data, so as to mine and discover the true distribution of data.
(5)Evaluate the customer segmentation model
The purpose of customer segmentation model is to divide
customers into different clusters according to their various
characteristics. According to the needs of enterprises,
customers in the same cluster should have similar contribution
and consumption tendency, while customers in different
clusters should try to be different in these aspects. These
characteristics can be measured according to the mean and
variance of customer attributes.
Clustering is a main technology in data mining. The process
of grouping a set of objects into multiple classes composed of
similar objects is called clustering [13-14]. After grouping, the
objects in the same class are similar, but the objects in different
classes are different. At the same time, cluster analysis is often
used as the first step of data mining, which preprocesses the
data, and then uses other algorithms to further analyze the
obtained classes. Clustering algorithm can be divided into
partition method, hierarchical method, density-based method,
grid-based method and model-based method.
K-means algorithm is one of the clustering algorithms based
on partition. It uses an iterative climbing method to discover
clusters and cluster centers from unlabeled data sets. Its
purpose is to divide
samples into
clusters, so that the
sum of squares of errors between the data samples in each
cluster and the mean value of this cluster is the smallest.
2. General Method and Process of
Customer Segmentation
3. Research Method
3.1 Clustering Analysis Algorithm
PROOF
DOI: 10.37394/232020.2022.2.17