CiteULike is a free online bibliography manager. Register and you can start organising your references online.

The Cultural, Ethnic and Linguistic Classification of Populations and Neighbourhoods using Personal Names Export

(March 2007)

Citation Format

[Posts]

View FullText article


ianturton's tags for this article

census classification data-mining ethnicity names population

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

There are growing needs to understand the nature and detailed composition of ethnic groups in today’s increasingly multicultural societies. Ethnicity classifications are often hotly contested, but still greater problems arise from the quality and availability of classifications, with knock on consequences for our ability meaningfully to subdivide populations. Name analysis and classification has been proposed as one efficient method of achieving such subdivisions in the absence of ethnicity data, and may be especially pertinent to public health and demographic applications. However, previous approaches to name analysis have been designed to identify one or a small number of ethnic minorities, and not complete populations. This working paper presents a new methodology to classify the UK population and neighbourhoods into groups of common origin using surnames and forenames. It proposes a new ontology of ethnicity that combines some of its multidimensional facets; language, religion, geographical region, and culture. It uses data collected at very fine temporal and spatial scales, and made available, subject to safeguards, at the level of the individual. Such individuals are classified into 185 independently assigned categories of Cultural Ethnic and Linguistic (CEL) groups, based on the probable origins of names. We include a justification for the need of classifying ethnicity, a proposed CEL taxonomy, a description of how the CEL classification was built and applied, a preliminary external validation, and some examples of current and potential applications.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.