How-to

Aggregating Survey Data in Excel — 5 Steps from CSV to Chi-Square Test

A practical 5-step workflow for downloading CSV from your survey tool and aggregating it in Excel. From UTF-8 encoding fixes and PivotTable-based GT/cross-tabs to the CHISQ.TEST function for chi-square testing, charting, and team sharing — complete one report in 30 minutes.

Downloading a CSV from your survey tool, aggregating it in Excel, and turning it into a report — this is the most frequent task in research operations, yet teams routinely lose 1–2 hours to PivotTable field misconfigurations, UTF-8 encoding issues, and CHISQ.TEST argument errors. This article organizes the workflow into 5 steps you can complete in 30 minutes, from CSV import to GT aggregation, cross-tabs, chi-square testing, and charting.

Each step includes "common failure points" and links to deep-dive articles for further reading.

Step 1: Import CSV into Excel (UTF-8 encoding fix)

The first hurdle when handling CSV in Excel is character encoding issues with non-ASCII text. If you double-click a CSV file to open it directly, files saved as UTF-8 without BOM will display as "?" or random garbled characters. Always open via "Get & Transform Data" instead.

Steps:

  1. Launch Excel → "Data" tab → select "From Text/CSV"
  2. Select your CSV file
  3. File Origin: specify "65001: Unicode (UTF-8)"
  4. Delimiter: select "Comma"
  5. Data Type Detection: "Based on entire dataset" recommended
  6. Click "Load"

Common failure point: People double-click to open directly, see garbled text, re-download, and double-click again — falling into an infinite loop. Make "Get & Transform Data" your default habit from the start.

For details, see the complete survey data cleaning guide, which covers post-import invalid-response detection (straight-liners / speeders) in practice.

Step 2: Build GT Aggregates with a PivotTable

GT aggregation (grand total) is the basic task of producing "response counts and percentages for each option per question." A PivotTable does this in one shot.

Steps:

  1. Select your data range → "Insert" → "PivotTable" → "OK"
  2. Drag the question column you want to aggregate to the "Rows" area
  3. Drag the same question column to the "Values" area (auto-counts entries)
  4. Value Field Settings → "Show Values As" → "% of Column Total" for percentage display
  5. Right-click "Row Labels" → "Sort" descending by count

Common failure point: When multiple-answer (MA) questions are stored in a single column with comma separators, PivotTable aggregation breaks. You need to either pre-split into multiple columns via "Data" → "Text to Columns," or check your tool's MA export settings (e.g., Kicue) to output split columns.

Step 3: Build Cross-Tabs (Attribute × Question)

Cross-tabulation looks at the relationship between two questions and is the most commonly used aggregation for research decision-making. Example: "How does satisfaction differ by age group?"

Steps:

  1. Create a new PivotTable (same data range)
  2. "Rows": attribute question (e.g., age group)
  3. "Columns": rating question (e.g., satisfaction)
  4. "Values": count (drag any column and select "Count")
  5. Value Field Settings → "Show Values As" → "% of Row Total" for row % display

This reveals "satisfaction distribution by age group" as percentages.

Common failure point: Reading too much into % for cells with small N (N < 30). "80% satisfaction among 40-somethings" sounds impressive, but if N=5 the margin of error is huge and it's not a basis for decisions. Always display N alongside % as a non-negotiable habit.

For details, see Survey Aggregation and Significance Testing — Cross-Tabs, Chi-Square Tests, and Effect Sizes, which organizes the 5 cross-tab patterns to look for.

Step 4: Chi-Square Test (CHISQ.TEST function)

When a cross-tab suggests "there might be a difference," use a chi-square test to determine whether the difference is statistically significant or just random variation. Excel's CHISQ.TEST function handles this.

Steps:

  1. Copy the observed values from your cross-tab to a separate sheet (exclude row/column subtotal cells — body cells only)
  2. Build an expected values matrix of the same size. Each cell: =ROUND(row_total*column_total/grand_total, 2)
  3. Run CHISQ.TEST: =CHISQ.TEST(observed_range, expected_range)
  4. The result (p-value) appears
  5. If p-value is below 0.05, the difference is statistically significant

Example: =CHISQ.TEST(B2:D4, F2:H4) → result of 0.023 means "the satisfaction difference by age group is statistically significant."

Common failure point: Forgetting to calculate expected values and passing two observed-value ranges. CHISQ.TEST requires both an "observed range" and an "expected range" — passing only observed values returns meaningless numbers.

Also, don't judge by p-value alone. With large samples, even tiny differences become significant, so the academic standard is to view it alongside effect size (Cramér's V). For details on effect size calculation and interpretation, again see the Survey Aggregation and Significance Testing guide.

Step 5: Charting and Team Sharing

Pasting aggregate values directly into PPT / Word doesn't communicate well. Choose the optimal chart for each question type and format for reporting.

Optimal charts by question type:

  • Single answer (SA): bar chart (horizontal recommended)
  • Multiple answer (MA): bar chart
  • Likert scale: divergent stacked bar chart
  • Cross-tabs: grouped bar chart / mosaic plot
  • Time series: line chart

Minimum formatting requirements:

  • Always include axis labels, legend, title, and data source
  • Annotate each chart with N=
  • Limit colors to 3 or fewer (including background)

Common failure point: Using pie charts with 5+ segments. They become visually hard to compare and increase cognitive load for the reader. For more than 5 segments, bar charts are the standard.

For details, see the survey result visualization guide, which organizes optimal charts by question type and the 5 dangerous patterns to avoid.

Editorial Perspective — 3 Tips to Streamline Excel Aggregation

From the standpoint of continuously tracking industry cases and field practitioner voices, here are 3 tips to streamline Excel aggregation.

  1. Build template sheets: Excel-template your GT aggregation, cross-tab, and chi-square skeletons, and for next projects only swap the data. Building from scratch each time turns 30 minutes into 2 hours.
  2. Never edit the raw data sheet: Perform calculations and charts on separate sheets, and keep raw data as backup. Accidentally overwriting raw data leads to unrecoverable accidents that happen all too often.
  3. Always annotate aggregates with "N=": "60% satisfied" is far less useful than "60% satisfied (N=120)" for the reader's judgment. Small-N results invite misinterpretation, so noting N is minimum-viable literacy.

CSV Export from the Survey Tool Kicue

Kicue's CSV export is formatted to ride smoothly on the Excel aggregation flow in this guide:

  • UTF-8 encoding: A structure that minimizes encoding issues during Excel import
  • Question text in column headers: Human-readable from row 1, no need to cross-reference question IDs separately
  • Respondent ID column: Useful for joining attributes in cross-tabs and extracting follow-up interview candidates (specific segment respondent extraction)
  • Independent columns per question: Structure that allows direct PivotTable aggregation

The output format for multiple-answer (MA) questions depends on settings at question design time, so in some cases column splitting via Text to Columns as in Step 2 is needed before PivotTable aggregation. The steps in this article handle this.

Note that chi-square testing and effect size calculation are realistically performed using Excel's built-in functions / R / Python / SPSS / JASP, and Kicue itself does not provide statistical analysis functionality. It's designed around a CSV-export-to-external-tool workflow.

Summary — 5 Steps in 30 Minutes

This gets you to the foundation of a report in 30 minutes. The next steps to implement are templatization and raw-data protection in your operational design.


If you want to export CSV in a format that's easy to aggregate in Excel, try the free survey tool Kicue. UTF-8 encoded CSV that's friendly to Excel imports, structured data with respondent ID columns, and independent columns per question — you can start the Excel aggregation 5 steps in this guide from a single account (advanced statistical analysis, effect size calculation, and Joint Display creation require integration with R / Python / SPSS / JASP).

References (1)

Related articles

Ready to create your own survey?

Upload your survey file and AI generates a web survey form in 30 seconds.

Get started for free