Skip to content

Middle School NGSS Resource Hub

Three-dimensional breakdowns, phenomenon ideas, misconceptions, and engagement activities for every NGSS middle school standard.

Chris Kesler
I'm Chris Kesler, a former award-winning middle school science teacher. This is the site I wish I'd had in the classroom. One hub with standard-by-standard breakdowns, three-dimensional learning framings, phenomenon starters, engagement ideas, and resources, all aligned to NGSS.

Middle School NGSS Standards

Pick any standard. Each page is your full lesson-planning workspace for that standard.

MS-LS3: Heredity: Inheritance & Variation of Traits
MS-LS3-1Mutations & Protein Structure MS-LS3-2Asexual vs. Sexual Reproduction
MS-ETS1-3 โ€ข Engineering Design

Analyzing Design Test Data: Cherry-Picking the Best Features Into a Better Solution

The Standard

"Analyze data from tests to determine similarities and differences among several design solutions to identify the best characteristics of each that can be combined into a new solution to better meet the criteria for success."

๐Ÿ“‹ Clarification Statement

NGSS does not list an explicit clarification statement for this standard.

โš ๏ธ Assessment Boundary

NGSS does not list an explicit assessment boundary for this standard.

Three-Dimensional Learning

The three dimensions packed into this standard

Every standard bundles a DCI (the content), a SEP (the science practice), and a CCC (the crosscutting lens). They run in the same task, not in sequence.

DCI โ€ข Content
Two Disciplinary Core Ideas bundle into this standard
ETS1.BDeveloping Possible Solutions

"There are systematic processes for evaluating solutions with respect to how well they meet the criteria and constraints of a problem. Sometimes parts of different solutions can be combined to create a solution that is better than any of its predecessors."

ETS1.COptimizing the Design Solution

"Although one design may not perform the best across all tests, identifying the characteristics of the design that performed the best in each test can provide useful information for the redesign process, that is, some of those characteristics may be incorporated into the new design."

When you test three designs, you rarely get one clear winner. Design A handles load like a champ but costs too much. Design B is cheap but flexes under weight. Design C is light but slow to build. The work isn't picking a winner. The work is reading the data, naming what each one did well, and combining those strengths into a v4 that beats all three.

What a student actually does Compares test results across multiple design solutions, identifies which features performed best on which criteria, and proposes a new solution that combines those strengths.
What this doesn't mean Students don't have to build every combined design they propose. They have to justify the combination with data.
Look for in student work They name specific features from each design and tie those features to specific test results, not vibes.
SEP โ€ข What Kids Do
Analyzing and Interpreting Data
NGSS verbatim

"Analyze and interpret data to determine similarities and differences in findings."

Students aren't picking favorites or going with gut feel. They're analyzing test data, finding the similarities and differences across designs, and using those patterns to decide what to keep, what to drop, and what to combine. The data drives the redesign, not the prettiest prototype.

What a student actually does Reads a data table or chart from multiple design tests, looks for similarities and differences across rows or columns, and uses that comparison to make a redesign decision.
What this doesn't mean No fancy statistics. No standard deviations. Just careful, side-by-side comparison of measurable outcomes.
Look for in student work They reference the data directly: "Design A held 2.1 kg, Design B held 1.4 kg, so the truss shape held more weight." Numbers do the talking.
CCC โ€ข Big Idea Lens
(not specified)
NGSS verbatim

"Engineering standards typically do not specify a single CCC."

Patterns show up across the test results. One design fails the same way every trial. Another excels under one condition and tanks under another. Spotting those patterns is what tells you which features are real strengths and which were lucky one-offs. No pattern, no useful redesign.

What a student actually does Notices patterns across designs. The arched designs all held more. The wider bases all stayed stable. The cheap materials all flexed.
What this doesn't mean A single trial isn't a pattern. Students should look across multiple designs and multiple trials before claiming one.
Look for in student work They name the pattern out loud and use it to justify a feature in the new design.

๐Ÿ“ Where This Standard Fits in the K-12 Progression

Use this to plan the year. Knowing what students should already know and what they're heading toward keeps the lesson focused.

3rd-5th Grade โ€ข Came In Knowing
3-5.ETS1.A; 3-5.ETS1.B; 3-5.ETS1.C

""

โ†’
Middle School โ€ข You Are Here
MS-ETS1-3

Analyzing Design Test Data: Cherry-Picking the Best Features Into a Better Solution

โ†’

๐ŸŒŽ Phenomena for MS-ETS1-3

Anchor the lesson in one puzzling phenomenon kids keep coming back to. Use the two investigative phenomena to sharpen specific facets.

๐Ÿ”ฅ
Anchoring Phenomenon

The Hybrid Car That Took Over the Driveway

Two engineers walk into a problem. One has a gas engine that's powerful but burns fuel. One has an electric motor that's clean but runs out of charge fast. Each design solves part of the problem and fails at another part. Then somebody combined them, put a small gas engine and an electric motor in the same car, and the hybrid was born. Same fuel tank, way more miles. Students will see hybrids on the road all week.

๐ŸŽฏ Driving Question

"How do engineers decide which parts of two different designs to combine, and how do they know the combination will be better than either one alone?"

๐Ÿ’ฌ Questions Students Will Keep Asking
  • "Why didn't they just pick the better one?"
  • "What did they have to give up to make the hybrid work?"
  • "Could you keep combining and combining until you got the perfect car?"
๐Ÿ’ง
Investigative Phenomenon

The iPhone Was Never Just a Phone

Pull up a picture of a 2006 desk: a flip phone, an iPod, a digital camera, a planner, a GPS unit. Then a 2007 iPhone. Same functions, one device. The iPhone wasn't a brand-new invention. It was a combined solution that pulled the best feature from each of those products into one design. Use this one to sharpen the lens the anchor is pushing on: combining beats inventing from scratch.

๐ŸŽฏ Driving Question

"When engineers combined features from five separate products into one, what trade-offs did they have to make for it to work?"

๐Ÿ’ฌ Questions Students Will Keep Asking
  • "If it was just combining stuff, why didn't anyone do it sooner?"
  • "What got worse when they combined? The camera? The battery?"
  • "Are there things on my phone that still don't work as well as the standalone version?"
๐Ÿงช
Investigative Phenomenon

Three Backpacks, One Better Backpack

A student tested three backpacks on the way to school. One had great straps but a tiny main pocket. One held everything but the zipper broke in a week. One was waterproof but heavy. None was great. The student sketched a v4 combining the straps from Bag 1, the layout from Bag 2, and the waterproofing from Bag 3. Same kind of design analysis the anchor demands, only with something every middle schooler has tested in real life.

๐ŸŽฏ Driving Question

"How would you decide which features from three so-so backpacks to combine into a v4 worth carrying?"

๐Ÿ’ฌ Questions Students Will Keep Asking
  • "What if combining the features makes it too heavy to wear?"
  • "How do I know which criterion matters most? Comfort, capacity, or waterproofing?"
  • "Would the combined backpack be cheaper or more expensive to make?"
Free download
All 3 phenomena + discussion prompts as a printable PDF
One page, ready to slide into your lesson folder. The anchor, both investigatives, and ready-to-go discussion prompts.
Download Free PDF

โš ๏ธ Misconceptions Your Students Will Walk In With

These come up almost every year. Knowing them in advance lets you head them off in the first lesson.

ร—

"The best parts of every design will always combine into something better."

โœ“

Not always. Sometimes two strong features cancel each other out. A super-strong material that's also super-heavy might fail a different criterion. Combining is a hypothesis to test, not a guarantee. The data from the next test tells you whether the combination actually worked.

ร—

"More data always gives you more redesign options."

โœ“

Data quality matters more than data quantity. If you only measured load, you have no way to combine for cost or weight. Pick your criteria before you test, measure those criteria carefully, and the analysis has something to work with. Fifty data points on the wrong variable don't help.

ร—

"If my first three designs all failed, the project is a bust."

โœ“

Three failures full of useful data are a great starting point. Maybe one held weight better, one cost less, one was easier to build. The redesign pulls the best piece from each. Failure isn't the opposite of progress in engineering. It's the data you need to make the next version smarter.

ร—

"You should keep testing and redesigning until your solution is perfect."

โœ“

Engineering runs on constraints. Time, money, materials, deadlines. "Perfect" usually means "too expensive" or "too late." The real goal is a solution that meets the criteria well enough inside the constraints. Once it does, ship it. The standard is "better meet the criteria for success," not "achieve perfection."

๐Ÿ™‹ Common Student Questions and How to Respond

These come up almost every time this standard gets taught. Plan a response and you'll keep the lesson focused.

What if all three of my designs are basically the same? How do I combine them?
How I'd respond

Look closer. Even small differences matter. Maybe one used more tape, one had a wider base, one was built faster. Those count as features. If the designs really are identical, the next move is to add variation on purpose: change one variable per design so the comparison has something to chew on.

Do I have to build the combined design or can I just describe it?
How I'd respond

It depends on what your data is for. If we're testing it, we build it. If we're using the analysis to propose a v4 on paper, a labeled sketch with a data-backed defense is enough. Either way, the combined design has to be tied to specific evidence from the earlier tests.

What if combining the best parts makes the design worse?
How I'd respond

That's a real result and worth reporting. Engineers run into this all the time. The strongest material might not pair with the cheapest structure. When a combination flops, the next move is to look at the trade-off, decide which criterion matters more, and pick accordingly. Sometimes the best v4 is closer to one of the originals than a true mash-up.

How do I know which features are 'best' when the criteria conflict?
How I'd respond

Rank your criteria before you analyze. If load capacity matters more than cost, the strongest design's features get priority. If cost rules, the cheapest design's features lead. Engineering isn't math class with one right answer. It's a defended choice based on which criterion you decided mattered most.

๐Ÿ“š Vocabulary Students Need for MS-ETS1-3

Twelve terms students need to access this standard. Definitions in plain-English, classroom-ready language.

Engineering Process
Criteria

The things a successful design must do. Hold a certain weight. Cost under a certain amount. Fit in a certain space. Criteria are the targets.

Constraints

The limits the design has to live inside. Time, materials, budget, size. Constraints are the walls.

Design solution

A specific version of an engineered object built to meet the criteria. v1, v2, v3, and v4 are all design solutions.

Redesign

The next version, built using what was learned from testing the previous one. Redesign is where the data does its work.

Trade-off

A choice where improving one criterion makes another worse. Stronger usually means heavier. Cheaper usually means less durable. Trade-offs force a priority call.

Iteration

One cycle of build, test, analyze, redesign. Engineering is iterative. v4 is the fourth iteration.

Data Analysis
Data

The measurements you collect during testing. Numbers, observations, failure points, costs. Data is what you'll analyze later.

Similarities and differences

What you look for when comparing multiple designs. Where did they perform the same? Where did they diverge? Both matter.

Pattern

A consistent result across multiple designs or multiple trials. If every arched design held more weight, that's a pattern worth using in the redesign.

Best characteristic

A feature of one design that outperformed the same feature in the other designs. Best is always relative to a specific criterion.

Combination

The act of pulling features from multiple designs into one new solution. The combination is the heart of this standard.

Cost-benefit comparison

A side-by-side look at what a design costs (money, time, materials) versus what it delivers (performance). Used to defend a redesign choice.

๐Ÿ’ก Free Engagement Ideas for MS-ETS1-3

๐Ÿ’ก

Three-Bridge Cherry-Pick

Teams build three paper bridges with the same budget (1 sheet of cardstock, 50 cm of tape, same span). Each team uses a different design: beam, truss, arch. They test all three under load (washers in a cup) and record load at failure, where it failed, and total tape used. Then they design a v4 that combines the strongest features from at least two of the three. Sketch and defend with data.

Materials: Cardstock, masking tape, washers, paper cup, metric ruler, data table sheet, sketching paper
๐Ÿ”

Water Filter Mash-Up

Students get specs and customer reviews for three real water filter designs (a pitcher, a faucet attachment, a portable straw filter). They build a comparison chart: filtration speed, cost, portability, replacement filter cost, and one drawback. Then they propose a hybrid design that pulls the best feature from each and explain which trade-off they had to accept. No actual filter is built. The work is in the analysis and the defense.

Materials: Printed spec sheets for 3 filter designs, printed review snippets, comparison chart, sketching paper
๐ŸŽฏ

School Chair Redesign Studio

Teams photograph or sketch three different chairs around the school: cafeteria chair, classroom chair, library chair. They build a criteria chart (comfort, durability, stackability, cost). They rate each chair 1-5 on each criterion based on a 5-minute sit-test. Then they design a v4 school chair combining the best features for "all-day comfort during testing season." Defend with the ratings.

Materials: Clipboards, chair photos or quick sketches, criteria chart printout, sketching paper, optional: ruler for measurements
๐Ÿงฉ

Catapult Combine

Pre-built or student-built mini catapults using three different launch mechanisms (spoon-and-rubber-band, popsicle-stick tension, mousetrap). Students measure launch distance, accuracy (hits within a 30 cm target), and reload time across 5 trials each. Analyze the data. Then sketch a v4 mechanism combining the best features. Optional extension: build it and re-test.

Materials: Pre-made mini catapults (3 styles), small soft projectiles (mini marshmallows or pom-poms), measuring tape, target sheet, data table

๐Ÿ“ Assessment Ideas for MS-ETS1-3

Three short tasks that hit all three dimensions. Doable in one class period each.

Task 1
Read the Data, Build the v4

Students get a pre-made data table showing three bridge designs tested on three criteria (load, cost, build time). They write a paragraph identifying the best feature from each design (tied to the data), then sketch a v4 combining those features. They label the sketch and write a 2-3 sentence defense citing the data.

DCI: ETS1.B, ETS1.C SEP: Analyzing data CCC: Patterns
Task 2
Spot the Combine Mistake

Students get four "combined" design proposals. Two correctly pull features supported by the data. Two pull features that contradict the data (claim a "strong" feature when the data shows that feature failed). Students mark which proposals are data-backed, which aren't, and explain the error citing specific numbers from the table.

DCI: ETS1.B, ETS1.C SEP: Analyzing data CCC: Patterns
Task 3
Defend the Trade-Off

Students get a scenario where the best two features from a test set can't both go into v4 (combining them breaks the budget or exceeds weight limits). They have to pick one, drop the other, and write a defense explaining which criterion they prioritized and why. The grade rests on the defense, not on which feature they picked.

DCI: ETS1.B, ETS1.C SEP: Analyzing data CCC: Patterns

๐ŸŽฏ What Proficient Student Work Looks Like

Same prompt, three student responses at different proficiency levels. Use as anchor papers when scoring.

The Prompt

"Three teams tested paper bridges. Use the data table below to identify the best feature from each design and describe a v4 that combines those features. Defend your choices with the data."

โœ… What I'd Look For in Their Work
  • A specific claim backed by data, observation, or model
  • Use of standard-specific vocabulary in context
  • Connection between the visible and the underlying explanation
  • A question they're still wondering about (curiosity stays alive)
Approaching
โœ๏ธ Student Wrote

Design B held the most weight, so I would build my v4 like Design B. It was the best one. I would also use Design A's paper because Design A was lighter.

๐Ÿ‘€ What I'd Notice

Picks a feature but doesn't tie it to specific data. Treats one design as a winner instead of pulling features from multiple. Doesn't address trade-offs. Stops at "Design B was best."

Meeting
โœ๏ธ Student Wrote

Design B held 2.1 kg, more than Design A (1.4 kg) and Design C (1.7 kg). So I would use Design B's truss shape in my v4. Design A used the least tape (20 cm vs. 50 cm and 35 cm), so I would use Design A's joint style to save material. My v4 would have a truss shape with Design A's joints. The trade-off is that the lighter joints might not hold as much as Design B's original, so I would do another test to check.

๐Ÿ‘€ What I'd Notice

Cites specific numbers. Pulls features from two designs and ties each to a criterion. Names a trade-off. Proposes a follow-up test. Hits exactly what the standard is targeting.

Exceeding
โœ๏ธ Student Wrote

Looking at the data, no single design won every category. Design B held the most weight (2.1 kg) because of its truss shape. Design A used the least tape (20 cm) because of its tight folded joints. Design C failed at the lowest load (1.7 kg) but recovered fastest after the weight was removed, suggesting flexible paper. My v4 combines Design B's truss shape, Design A's folded joint style, and Design C's lighter paper grade. The pattern I noticed is that truss shapes held more in every trial. The trade-off I had to accept is that Design C's lighter paper might fail earlier than Design B's heavier paper. I prioritized weight savings over maximum load because the criteria sheet ranked 'lightweight' above 'maximum load.' I would build the v4 and re-test against the same load protocol to confirm.

๐Ÿ‘€ What I'd Notice

Quantitative citations throughout. Pulls features from all three designs, each justified by a different criterion. Identifies a pattern across designs (truss = more load). Names the trade-off explicitly. Defends the priority call by referencing the criteria ranking. Proposes the next iteration. This is the analyze-combine-defend chain the standard targets.