Data Analysis with Python Projects - Demographic Data Analyzer

**Tell us what’s happening: Pandas will not import

Describe your issue in detail here.

Every time I run the code on this challenge it says there is an error with the code on the main file on line 2 “import demographic_data_analyzer”.

Finding a way to solve this, I tried deleting line 1 “import pandas as pd” on the demographic_data_analyzer file. Which to my surprise worked and the code ran with no error, however the point of the challenge is to use pandas so I’m really stuck on what to do now. As i wont be able to complete the challenge without pandas.

Your code so far

#This is the unedited code I was given

import pandas as pd

def calculate_demographic_data(print_data=True):
# Read data from file
df = None

# How many of each race are represented in this dataset? This should be a Pandas series with race names as the index labels.
race_count = None

# What is the average age of men?
average_age_men = None

# What is the percentage of people who have a Bachelor's degree?
percentage_bachelors = None

# What percentage of people with advanced education (`Bachelors`, `Masters`, or `Doctorate`) make more than 50K?
# What percentage of people without advanced education make more than 50K?

# with and without `Bachelors`, `Masters`, or `Doctorate`
higher_education = None
lower_education = None

# percentage with salary >50K
higher_education_rich = None
lower_education_rich = None

# What is the minimum number of hours a person works per week (hours-per-week feature)?
min_work_hours = None

# What percentage of the people who work the minimum number of hours per week have a salary of >50K?
num_min_workers = None

rich_percentage = None

# What country has the highest percentage of people that earn >50K?
highest_earning_country = None
highest_earning_country_percentage = None

# Identify the most popular occupation for those who earn >50K in India.
top_IN_occupation = None

# DO NOT MODIFY BELOW THIS LINE

if print_data:
    print("Number of each race:\n", race_count) 
    print("Average age of men:", average_age_men)
    print(f"Percentage with Bachelors degrees: {percentage_bachelors}%")
    print(f"Percentage with higher education that earn >50K: {higher_education_rich}%")
    print(f"Percentage without higher education that earn >50K: {lower_education_rich}%")
    print(f"Min work time: {min_work_hours} hours/week")
    print(f"Percentage of rich among those who work fewest hours: {rich_percentage}%")
    print("Country with highest percentage of rich:", highest_earning_country)
    print(f"Highest percentage of rich people in country: {highest_earning_country_percentage}%")
    print("Top occupations in India:", top_IN_occupation)

return {
    'race_count': race_count,
    'average_age_men': average_age_men,
    'percentage_bachelors': percentage_bachelors,
    'higher_education_rich': higher_education_rich,
    'lower_education_rich': lower_education_rich,
    'min_work_hours': min_work_hours,
    'rich_percentage': rich_percentage,
    'highest_earning_country': highest_earning_country,
    'highest_earning_country_percentage':
    highest_earning_country_percentage,
    'top_IN_occupation': top_IN_occupation
}

Your browser information:

User Agent is: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/115.0.0.0 Safari/537.36

Challenge: Data Analysis with Python Projects - Demographic Data Analyzer

Link to the challenge:

Can you link to your replit?

Sorry forgot to include that, here it is:

When I run your code I get the error:

ModuleNotFoundError: No module named ‘pandas’

At the Shell command prompt type pip install pandas

I think that’s worked. Thank you!

1 Like

Just make sure to put back import pandas as pd :+1:

Hi again! Think I’m having a similar issue were an error pops up saying “line 2 import demographic_data_analyzer” on the main file. I don’t know what I did as it was working fine for a good while after I installed the pandas module. I tried refreshing the page and installing pandas again maybe but didn’t solve the problem.

The link to my replit is below:

Thank you :slight_smile:

Read the whole error message, relevant part is at the bottom.

python3 main.py
Traceback (most recent call last):
File “/home/runner/e4PUH2Pl9vv/main.py”, line 2, in
import demographic_data_analyzer
File “/home/runner/e4PUH2Pl9vv/demographic_data_analyzer.py”, line 22
higher_education = df[df[‘education’].isin([‘bachelors’,‘masters’,‘doctorate’])]
IndentationError: unexpected indent
exit status 1

This traces the problem back through steps. Read it backwards really, from the bottom up.

Important part is here:

File “/home/runner/e4PUH2Pl9vv/demographic_data_analyzer.py”, line 22
higher_education = df[df[‘education’].isin([‘bachelors’,‘masters’,‘doctorate’])]
IndentationError: unexpected indent

This topic was automatically closed 182 days after the last reply. New replies are no longer allowed.