top of page
  • Writer's pictureGourav S

CSE Prelims Results - What's in a name?

Does being a "Sharma ji ka beta"increase your chance of clearing CSE? If we go by data, then the answer is (partly) yes!



But, not more than that of Kumar Ji ka beta, Singh ji ka beta or Meena ji ka beta!


Also, if we go by statistics, having your name "Abhishek" definitely helps.

Keep it simple!


The analysis has been done on Namewise CSE prelims 2022 result available on upsc.gov.in

WR-CSP-22-engl-Namelist-220622-3
.pdf
Download PDF • 1.27MB

This Sunday, I decided to conduct a analysis of the CSE Prelims result. When I started I was planning to leverage simple NLP tools like TF-IDF to assess the relative importance of each term and then something on the lines of vectorisation to establish relationship between strings of texts


However, I quickly realised that even a simple excel analysis focusing on frequency analysis can be fun and can bring out interesting insights. As the Einstein said -


Any intelligent fool can make things bigger and more complex. It takes a touch of genius - and a lot of courage - to move in the opposite direction.

I quickly separated the First name, middle name and last name from the list and used pivot table to create a frequency table. A layman can understand this as using Crtl+F to count how many times each search term (which can be first name, last name or even middle name) is appearing


The findings - Last/Middle Name


Here is the top 10 most common frequencies

Text String

Frequency

Kumar

1777

Singh

986

Meena

365

Sharma

281

S

279

Yadav

266

Abhishek

203

​Gupta

198

Agarwal/Aggarwal/Agrawal/ Agrwal

168

Mishra

166

Jain

166

​Shubham

161

Rahul

151

  • There are 1777 "Kumar" out of 13,000 selected aspirants i.e. 1 out of every 7 aspirants

  • "Singh" appeared 986 times.

As both Kumar and Singh are generic surnames as they are commonly used as middle names. I remember, I was "Gourav Kumar Sharma" for around one academic year when I decided to change it myself to shorter "Gourav Sharma" and Gourav S later on.

  • The third most common string was "Meena" with 365 appearances.

  • Now comes, Sharma ji ka betas/betis with 288 candidates having "Sharma" in their name.

  • S can be due to people using it as middle name or people shortening to surname e.g. Gourav S

  • Then comes Gupta and Agarwal. Due to multiple spellings, I decided to club 4 most similar spellings (Yes, excel allows you to do that) - Agarwal/Aggarwal/Agrawal/ Agrwal. "Gupta" appeared 198 times and Agarwal/variants appeared 166 times.

  • However, owing to my limited knowledge of castes I couldn't group castes with different surname.

  • Next comes Mishra with frequency of 166.

  • What if combine most commonly occurring Sharma and related terms - Mishra, Pandey, Tiwari, Shukla, Dwivedi etc. The number comes up to 788!


The findings - First Name


The most common first names are

  1. Abhishek - 203

  2. Shubham - 161

  3. Rahul - 151

  4. Raj - 129

  5. Prakash - 122

  6. Aditya - 111

  7. Ankit - 102

  8. G̶o̶u̶r̶a̶v̶ Gaurav - 99

  9. Saurabh - 92

  10. Ashish - 92

  11. Akshay - 92

  12. Amit - 91

The findings - Full Name


If we combine above results, we can get to the answer

Yes, the most common name in the list was "Abhishek Kumar"

Scope of Improvement

  1. Any socio-religious analysis has been avoided here due to lack of expertise on author's part.

  2. As government jobs are viewed as barometer of social progress in India, this can be used to analyse empowerment status of various social groups in India.

  3. Relative frequency i.e. frequency divided by the population of a "search term" would provide a better analysis to show relative deprivation/empowerment

  4. Temporal analysis through combining data from last 10-15 years.

  5. Due to (assumed) more uniformity in North Indians terms/names in comparison to other parts, the analysis is a little biased towards North Indian names.

People looking to collaborate on further research can mail me at gs@gouravs.com


Disclaimer

Correlation is not causation.

This post in now way encourages you to change your name to Abhishek Kumar or Shubham Kumar. Author holds no liability for someone failing to clear prelims even after changing names.

1,745 views0 comments

Recent Posts

See All
bottom of page