Skip to main content

Value the pull, not just the peak

Generated from content/lms/engine-and-powertrain/03-test-before-you-tune/04-value-dynamic-testing-not-just-peak-numbers.md; edit the source file, not this page.

Source path: content/lms/engine-and-powertrain/03-test-before-you-tune/04-value-dynamic-testing-not-just-peak-numbers.md

Course: Engineer the torque path from engine to pavement

Module: Test before you tune

Estimated duration: 55 minutes

Lesson goal

You are not learning how to brag about the highest horsepower number on a printout. You are learning how to use a dyno pull as test evidence. That means you care about the whole run: how repeatable it is, what conditions produced it, how the engine behaves through the rpm range you actually use, whether the instrumentation is fast enough for the question, and whether the result says something you can act on before you spend track time chasing a guess.

The practical rule is simple: a peak number is only one sample from a larger test. The pull is the evidence. A valid pull tells you how torque and speed behaved over time, under known conditions, on a known rig, with enough repeatability that a before-and-after comparison means something. If the test cannot do that, the peak number may still be entertaining, but it is weak evidence.

This matters because engine power is not a static object. Power is the product of torque and speed, and both signals fluctuate. The engine never runs totally steadily, and the torque and speed signals coming from the dyno fluctuate too. An instantaneous value can differ from a longer-term average, so sampling time and averaging are not clerical details. They are part of the test design. If you stare only at the highest number, you can crown a momentary spike, a noisy sample, or a condition that will not repeat when you actually drive the car.

The older racer wisdom is still useful here. Engine development should ideally happen on an engine dyno before the engine goes into the car, and a chassis dyno can be a useful practical tool for production-based cars. Even simple straightaway acceleration runs provide real-world information, but a dyno saves time and improves precision because it lets you ask the same question under more controlled conditions than a track session usually allows. The mistake is treating the dyno as a scoreboard instead of an experiment.

What the peak hides

A peak number hides four things that matter to a driver.

First, it hides repeatability. If the best pull is higher but the next two pulls fall back, you do not yet have an improvement. You have a best event. The corpus is explicit that speed and torque signals fluctuate and that sampling time is an experimental choice. For an intermediate driver or club racer, that means your first job is not to celebrate the highest line on the page. Your first job is to ask whether the test can reproduce the same answer when nothing meaningful has changed.

Second, it hides the baseline. A chassis dyno comparison becomes useful when you have tested the car on the same dyno and under approximately the same temperature and humidity conditions. Then you can see what the modification did. Without that baseline, the number may say more about the rig and the day than about the engine. The actual figure is less important than the increase that has been realized under comparable conditions.

Third, it hides dynamic behavior. Some engine problems only appear when the engine is asked to change state. A gear change involves throttle closing, an engine speed change that may be around 2000 rev/min, and then reapplication of power. A steady-state number at one load point can look fine while the car still feels lazy, ragged, or inconsistent when you shift and ask for drive again. Transient testing exists because a real engine in service does not live forever at one neat point on a chart.

Fourth, it hides instrumentation limits. Exhaust-gas instruments used for steady work tend to be accurate, sensitive, stable, well damped, and relatively slow. Instruments used for true transient work give up some absolute accuracy because they must respond quickly. Exhaust sampling locations also introduce time and distance lags. If you are trying to diagnose a rapid event, a slow or delayed signal can make the trace look late, soft, or misleading. The pull is not just the engine. It is the engine plus the rig plus the sensors plus the sampling choices.

The principle: make the test answer the driving question

Before you tune, state the driving question in a way the test can answer. Do not ask whether the car made a heroic number. Ask what you need to know.

If you changed jetting, ignition settings, or another final tuning variable, your question may be: did this change improve the engine under the conditions and rpm range I am going to use? The chassis dyno example from the corpus is exactly that practical. You put the car on the rollers, hook up instrumentation, go through the gears into the rpm range you want to use, and read the output. The result matters because it can keep you from arriving at the track and finding out the car is not giving everything it is capable of.

If you changed an engine control strategy, your question may be different: did the advance or mixture curve improve in the part of the run that was actually weak? A broad trick, such as moving a sensor signal to change mixture or timing across the board, is a crude last-ditch measure. Dyno tests are more likely to reveal that the engine needs complex changes in the advance or mixture curve. That is a different conclusion from the engine simply needs more of everything.

If you are working with a turbocharged engine, the question may be dynamic: what happens when you suddenly demand more power? The source material points out that turbocharged engines face a special control problem under sudden demand. Air flow cannot always rise ahead of fuel flow unless special means are available, so acceleration fuel control becomes a compromise among performance, driveability, and exhaust cleanliness. A peak number at stabilized speed does not tell you enough about that compromise.

If you are trying to model gear-change behavior, the question is more severe: can the test sequence accurately model what the engine sees in service, and can the sequence repeat precisely? The transient-test engineer has exactly those two challenges. The dyno must be capable of following the event quickly enough, and during clutch disengagement it may need to impose zero torque while following the free engine speed. That is much more demanding than making one full-throttle sweep and printing a single maximum value.

The technique: how to read a dyno pull as evidence

Start by writing the test question. Keep it short and mechanical. Examples: confirm whether the ignition change improves the usable rpm range; verify whether richer jetting helps without hurting the rest of the pull; check whether the turbo engine recovers cleanly after a sudden power demand; compare the new calibration with the last known good baseline. If you cannot state the question, you will be tempted to accept whichever number flatters the build.

Next, choose the rig that matches the question. An engine dynamometer is the ideal tool for engine development because it isolates the engine before installation. A chassis dyno is more available and can be used to good effect on moderately powerful production cars, especially for final tuning. Portable dynos that bolt to the bellhousing or output shaft can also teach a racer a lot faster than cutting, trying, and measuring lap times. But each rig has limits. Roller horsepower measurements can be limited by tire slip, which makes some chassis dyno results less accurate. That does not make the chassis dyno useless. It means you treat it as a comparative tool and respect the torque path between the crankshaft and the measurement.

Then preserve the baseline. Use the same dyno when you can. Keep temperature and humidity approximately comparable. Record the gear or operating method, the rpm range, the warm-up state, the tire and roller condition if you are on a chassis dyno, and the instrumentation used. If the facility can tell you the sampling or smoothing settings, write those down too. You are trying to keep the test from changing while the engine change is what you are evaluating.

Now make the first pulls serve repeatability, not ego. If the baseline pulls do not agree closely enough to give you confidence, stop interpreting. Find the reason. It may be heat, tire slip, a loose strap, inconsistent throttle application, a sensor issue, or simply a test method that is not settled yet. The source material gives the governing idea: torque and speed fluctuate, and the choice of sampling time and averaging is a compromise. You cannot make a good tuning decision from a noisy foundation.

Once the baseline is credible, evaluate the whole pull. Look at where the engine is better, where it is the same, and where it is worse. A modification that adds a small peak at the top but weakens the rpm range you use after each upshift may be a bad driving change. A modification that does not win the headline peak but makes the run cleaner, more repeatable, and stronger where the car actually operates may be the correct choice. The corpus does not give permission to invent a magic area-under-the-curve formula here, so keep the decision practical: compare the tested rpm range you intended to use, not the single highest sample.

Finally, connect the curve to the control inputs and sensors. If the dyno shows the need for complex advance or mixture changes, do not pretend a crude global trick is precision tuning. A programmable engine control unit may be justified because it can shape the curve instead of moving the whole map blindly. The useful sensor list is not decorative. Barometric pressure, fuel pressure, coolant temperature, ambient air temperature, fuel temperature, exhaust gas temperature, lambda sensing, knock sensing, fuel flow, and driveline torque can all matter depending on the question. The more specific the test question, the more specific the sensor evidence should be.

Sub-skill 1: separate comparison from measurement

Measurement asks what number the dyno displayed. Comparison asks what changed between two controlled tests. For club-level tuning, comparison is usually the stronger use of the dyno.

This is especially true on a chassis dyno. The chassis dyno includes tires, rollers, driveline losses, strapping, and sometimes tire slip. That makes it a less pure engine instrument than an engine dyno. But if you use the same rig and comparable conditions, it can still show whether a modification improved the car. That is why the baseline is central. You are not trying to settle a public argument about the car's absolute horsepower. You are trying to make a better decision about your car.

A useful comparison has three traits. The before condition is known. The after condition changes only what you intended to change. The test method is close enough that the difference is more likely to come from the modification than from the process. When those traits are absent, the printout becomes weak evidence no matter how dramatic the peak looks.

Sub-skill 2: treat sampling as part of the setup

When a dyno reports power, it is working from torque and speed. Neither signal is perfectly still. If the system samples too briefly, it can overvalue a transient spike. If it averages too long, it can flatten a real event. The right sampling choice depends on the question.

For a steady-state check, you may want stable, damped, accurate measurement. For a true transient event, you need speed. The exhaust-analyser example is a clean demonstration. Instruments for steady work must be accurate and stable, and they tend to respond slowly. Instruments for transient work must respond very quickly, preferably in milliseconds, even though they may not reach the same standard of absolute accuracy. If your question is rapid throttle response or gear-change recovery, do not overinterpret a slow signal as if it were an instantaneous truth.

This changes how you talk to the dyno operator. Instead of asking only for the power sheet, ask what the pull method measures well and what it does not. Ask whether the trace is smoothed. Ask how repeatability is judged. Ask whether the instrumentation can see the event you care about. You do not need to become a test-cell engineer, but you do need to know whether the tool is answering your question or merely producing a graph.

Sub-skill 3: respect transient behavior

A track car spends much of its life changing state. You close the throttle, shift, reapply power, ride through torsional response, and ask the engine to accept a new load. The source material treats gear shifts as a characteristic transient-test problem. Profiles vary from very fast race-car automatic changes to older commercial-vehicle changes that may take more than 2 s, with a wide range of driver behavior in between.

For you, the lesson is not that every club car needs a four-quadrant a.c. dynamometer. The lesson is that steady numbers and dynamic behavior are different questions. If the complaint is that the car falls on its face after the shift, surges when power comes back in, or feels inconsistent when you ask for drive, a peak horsepower pull may miss the problem. You need a test method that exercises the transition or at least does not pretend a steady answer explains a transient symptom.

Transient testing also requires repeatability. The test sequence should model the service condition accurately, and then it should repeat precisely. If the first run uses one throttle reapplication profile and the next uses another, the comparison is polluted. This is the dyno equivalent of changing your line, brake release, and throttle timing while trying to compare tire pressures. You might still learn something, but you have made the signal harder to see.

Sub-skill 4: read turbo response as a compromise, not a peak number

Turbocharged engines make the peak-number trap especially attractive. A high stabilized number can look like a win while the car remains difficult to drive when demand changes quickly.

The control problem is straightforward. When a driver asks for more power suddenly, the engine needs fuel and air. In process-control terms, one wants air first on the way up and fuel first on the way down to avoid air deficiency. In a turbocharged engine, only special means such as variable turbine geometry can increase air flow in advance of fuel flow. That makes acceleration fuel control a compromise among performance, driveability, and exhaust cleanliness.

So when you test a turbo car, ask whether the engine behaves well when power demand changes. Does it recover cleanly after the sort of speed change and throttle reapplication the car sees on track? Does the calibration produce a better usable pull, or only a higher stabilized number? Does the sensor data support the conclusion, or is the dyno sheet hiding a lag, overshoot, or rough transition? The answer may still be that the higher peak setup is better. But it earns that answer by surviving the dynamic question.

Sub-skill 5: use sensors to explain the result

The dyno curve tells you what happened. The supporting channels help explain why.

The source material lists a practical set of inputs that can matter in engine-control work: barometric pressure, fuel pressure, oil pressure as a safety factor, coolant temperature, ambient air temperature, fuel temperature, exhaust gas temperatures by cylinder, lambda sensors, knock sensors, fuel-flow measurement, and even driveline torque. You do not need every channel for every test. You need the channels that answer your question.

If the engine loses power after several pulls, temperature channels may explain whether the comparison is fair. If one cylinder is running hot, exhaust gas temperature may keep you from calling a dangerous condition a tuning gain. If the engine wants timing in one part of the curve but knocks elsewhere, knock sensing changes the decision. If fuel temperature rises in a return-flow system, a later pull may not be equivalent to an earlier one. The point is not to drown yourself in data. The point is to keep the dyno graph from being the only witness.

Worked example: final race tuning on a chassis dyno

You have a moderately powerful production-based car and you are choosing final jetting and ignition settings before an event. The tempting move is to book dyno time, make a pull, and leave with the highest number from the day. That is not a test plan. It is a souvenir hunt.

A better plan starts with the baseline. Use the same chassis dyno you have used before, or accept that your comparison will be weaker. Try to keep temperature and humidity close to the baseline conditions. Drive the car onto the rollers, let the instrumentation be hooked up, and run into the rpm range you actually intend to use. The corpus example uses 6000 rpm as the tach point, but the broader lesson is not that every car should be judged at 6000. The lesson is to define the rpm range before the pull instead of chasing whatever point produces the biggest number.

Make the baseline pulls first. If they are not repeatable enough to trust, fix the procedure before changing the car. Once the baseline is credible, make the jetting or ignition change and repeat the same method. Now ask three questions. Did the output improve compared with the baseline? Did it improve in the rpm range you actually use? Did the supporting signals and engine behavior stay acceptable?

Suppose the new setting wins only at the very top but is weaker through the range you fall into after an upshift. You should not call that an automatic improvement just because the peak is bigger. Suppose it gives up a small top-end number but makes the pull cleaner and more repeatable through the range where you accelerate out of slower corners. That may be the better driving setup. The dyno did its job because it prevented you from discovering the answer by wasting track sessions.

Worked example: a turbo engine that looks strong but responds poorly

Now take a turbocharged engine. The dyno sheet shows a strong peak at stabilized wide-open throttle, but the driver reports that the car is not clean when power is reapplied after a shift or after a slow corner. If you only value the peak, you will miss the complaint.

The source material gives the mechanism. A sudden demand for more power is difficult because the engine needs air flow as well as fuel flow. In a turbocharged engine, air flow may not rise ahead of fuel flow unless special hardware or control strategies are available. The calibration has to balance performance, driveability, and exhaust cleanliness during acceleration.

Your test question should therefore be dynamic. You are not merely asking how high the final number is. You are asking how the engine behaves when demand rises. If the facility can perform a transient sequence that represents the event, use it. If it cannot, be honest about the limitation and do not pretend a steady pull proves the transition. Watch the relevant channels. A change that adds peak but worsens the response problem has not solved the driver's issue. A change that gives a slightly lower peak but improves the demand transition may make the car faster and easier to place on track.

Worked example: the limited-budget racer with an off-brand engine

The corpus describes a racer with fewer dollars or an off-brand engine who has to develop it on his own. That racer can learn faster on a dyno than by cutting, trying, and measuring lap times. This is a realistic club-racing situation. You may not have factory maps, a large test program, or endless track days. You still need disciplined evidence.

The wrong version of this story is scattered experimentation. Change a part, drive a session, look at lap times, change another part, and argue in the paddock about whether the engine feels better. Lap time contains too many variables: traffic, driver execution, tire state, weather, fuel load, and setup. Straightaway acceleration runs do provide real-world information, but they are not as precise as a controlled dyno comparison.

The better version is to use the dyno to narrow the problem before the track. Establish the baseline. Change one engine variable when possible. Repeat the pull. Record enough conditions that you can return later and compare honestly. Then take only the defensible changes to the event. You are not using the dyno because it is more glamorous than the track. You are using it because it makes the learning loop shorter and cleaner.

Calibration cues: how you know you are improving

You are improving when your dyno conversations become less about the big number and more about the evidence chain.

The first cue is repeatability. Baseline pulls agree closely enough that the operator, builder, and driver can trust the comparison. You do not need fake precision, but you do need enough consistency that a change can be distinguished from ordinary fluctuation.

The second cue is condition control. You can state the dyno used, the approximate temperature and humidity context, the operating method, the rpm range, and the instrumentation. If you cannot describe the test, you probably cannot defend the conclusion.

The third cue is range awareness. You stop asking only where the graph peaks and start asking where the car spends time when driven. The useful part of the pull is tied to the rpm range selected for the test and the way the engine is used after shifts and during acceleration.

The fourth cue is sensor literacy. You use temperature, pressure, lambda, knock, exhaust temperature, fuel, or torque channels when they are relevant. You also know when a sensor is too slow or delayed for the event you are trying to interpret.

The fifth cue is track readiness. A good dyno session reduces surprises at the event. It does not guarantee a lap time by itself, but it helps you avoid arriving at the track with jetting, ignition, or calibration settings that leave performance unused or behavior unexplained.

Common mistakes

The first mistake is crowning the single best pull. What it feels like is easy: everyone points at the highest number and the session becomes a celebration. What it costs is clarity. If that number came from a non-repeatable fluctuation, a different condition, or a sampling artifact, the car may not actually be better. Good looks like comparing repeatable before-and-after pulls under comparable conditions.

The second mistake is comparing different worlds. A pull from one dyno on one day is compared with a pull from another dyno or another weather condition as if the two were absolute truth. The source material is clear that the same dyno and approximately the same temperature and humidity matter when you want to evaluate modifications. Good looks like treating unmatched tests as weak evidence and using them cautiously.

The third mistake is ignoring the torque path. A chassis dyno is practical and useful, but tire slip on the rollers can limit horsepower measurement accuracy. If you forget that, you can diagnose an engine change when the measurement path is part of the issue. Good looks like using the chassis dyno as a controlled comparison tool and, when necessary, moving engine development to an engine dyno or a more direct setup.

The fourth mistake is using a steady test to answer a transient complaint. If the problem happens after a shift or during sudden throttle demand, a stabilized peak number may not answer it. Good looks like either running a transient sequence that models the service condition or admitting that the current test only partially addresses the complaint.

The fifth mistake is trusting slow data during fast events. A damped, stable exhaust analyser may be excellent for steady work and poor for millisecond-level transient diagnosis. Sampling point distance and instrument response can delay the signal. Good looks like matching instrument response to the question and being careful with timing claims.

The sixth mistake is fixing a curve problem with a crude global trick. Moving a distributor, crank sensor, throttle potentiometer, or other signal may create a broad change, but dyno evidence often points to a need for more complex advance or mixture curve changes. Good looks like using a control system and sensors that can shape the correction where it is needed.

The seventh mistake is using the track as the first test cell. Track feedback matters, and straightaway acceleration runs are real-world information. But if the engine change can be tested more precisely before the event, dyno work saves time. Good looks like arriving at the track with the main engine questions already narrowed.

Drill: baseline-change-repeat dyno discipline

Use this drill at your next dyno opportunity. It is designed for one focused tuning question, not a full engine-development program.

Before the session, write one test question on your notes sheet. Keep it specific: ignition setting, jetting change, calibration revision, or response complaint. Write the rpm range you care about and why. Bring the last useful baseline if you have one.

Step 1 is the baseline set. Make three baseline pulls using the same method. Record the dyno, approximate temperature and humidity, gear or operating method, rpm range, and relevant sensor channels. If the pulls do not repeat well enough to trust, stop the drill and fix the test method. Success for Step 1 is not a number. Success is a baseline you believe.

Step 2 is the single change. Make the intended change and repeat the same pull method. Do not change the question in the middle of the test. If safety channels show a problem, stop and treat that as the result. The dyno is allowed to tell you not to continue.

Step 3 is the pull review. Compare the baseline and changed sets across the rpm range you wrote down before the session. Mark three regions: better, unchanged, worse. Then review any relevant channels that explain the result. If the result is a turbo or gear-change response question, also mark whether the test actually exercised the transient event. If it did not, write partial evidence instead of solved.

The success criterion is a decision you can defend without pointing only at the highest peak. At the end, you should be able to say: under these conditions, on this rig, with this method, this change improved or did not improve the part of the pull we care about, and the supporting evidence either agrees or raises a concern. If you cannot say that, the drill did its job by preventing an overclaim.

When this principle breaks down

There are times when the honest answer is that the current test cannot support the conclusion.

If you have no baseline, you can still learn, but you should not speak as if you have measured an improvement. If the dyno or conditions changed, treat the comparison as provisional. If tire slip is present on the rollers, do not assign all variation to the engine. If the signal is slow or delayed, do not use it for precise transient timing. If the facility cannot model the gear-change or sudden-demand event you care about, do not let a steady peak answer a dynamic complaint.

This restraint is not weakness. It is good testing. A dyno session that tells you the evidence is incomplete is still valuable because it keeps you from making a confident wrong call.

Cross-references inside this module

This lesson sits between the other testing lessons in the module. Separate the test from the torque path when you are deciding whether the engine or the measurement path produced the result. Spec the rig around the unit under test when the question demands an engine dyno, chassis dyno, portable dyno, or transient-capable system. Control the test environment before reading the pull because comparison depends on the baseline. Turn test evidence into action only after the pull, the conditions, the sampling, and the supporting channels agree well enough to make the decision defensible.

The thread through all of them is the same. Do not tune from a trophy number. Tune from a controlled comparison that answers the driving question.

Worked example: final race tuning on a chassis dyno

Use the chassis dyno as a comparison tool, not a public horsepower court. For a production-based car, establish the baseline on the same dyno and under approximately similar temperature and humidity conditions, then test the jetting or ignition change in the rpm range you intended to use. If the new setting wins only at the top but is weaker through the range you fall into after an upshift, the higher peak has not automatically made the car better. The defensible result is the setting that improves the tested range and stays supported by the instrumentation.

Worked example: a turbo engine that looks strong but responds poorly

A turbo engine can show a strong stabilized peak while still behaving poorly when you suddenly demand power. The useful question is not only how high the final number is, but how the engine handles the transition after throttle reapplication or a gear change. Because air flow may not rise ahead of fuel flow without special means, acceleration control becomes a compromise among performance, driveability, and exhaust cleanliness. A test that cannot exercise that event should be marked partial evidence, not proof.

Common mistakes

The common failures are crowning the best single pull, comparing unmatched dyno days, ignoring tire slip and torque-path limits on a chassis dyno, using a steady test for a transient complaint, trusting slow exhaust data during fast events, and trying to fix a curve problem with a crude global sensor trick. Good testing looks calmer: same baseline, same method, comparable conditions, appropriate sensor speed, and a conclusion tied to the rpm range and behavior the car actually uses.

Drill: baseline-change-repeat dyno discipline

At the next dyno session, choose one tuning question before the first pull. Make three baseline pulls, record the dyno, approximate conditions, method, rpm range, and relevant channels, and stop if the baseline will not repeat well enough to trust. Make one intended change and repeat the method. Your success criterion is a decision you can defend without pointing only at the peak: under these conditions, on this rig, with this method, the change improved or did not improve the part of the pull you care about, and the supporting channels either agree or raise a concern.

When this principle breaks down

The honest conclusion may be that the current test cannot answer the question. No baseline, changed conditions, roller slip, slow or delayed instrumentation, and a steady pull used for a gear-change or sudden-demand complaint all weaken the claim. Treat those cases as incomplete evidence rather than forcing a confident tuning decision from a graph that does not support it.

Cross-references

This lesson connects directly to the sibling lessons on separating engine results from the torque path, controlling the test environment before reading the pull, selecting the rig around the unit under test, and turning test evidence into action. The shared principle is that a dyno result becomes useful only when the test method, environment, measurement path, and decision all line up.

Author Review

No quiz questions are attached to this lesson.

Sources

#DocumentChunkPagesScoreCollection
1Driving in competition None Johnson Alan 1935- None0653e176-dd04-4e90-fab7-e6a03591cd861301uio_books_raw_v1
2Engine Testing Theory and Practice Plint Martyr9c78cd2c-ac5c-e4af-1a87-d6d99d7560ad1711uio_books_raw_v1
3Engine Testing Theory and Practice (Plint, Martyr)bdbe746b170f63109bfeae89aa3685152571uio_books_raw_v1
4Engine Testing Theory and Practice Plint Martyrcbc4e3a8-ed04-4ba2-f0bc-f634f845610f2581uio_books_raw_v1
5Engine Testing Theory and Practice Plint Martyr73930687-1f14-3c7e-f490-df838ff52eb23561uio_books_raw_v1
6Race Car Engineering Mechanics Paul Van Valkenburgh77f1653e-c36e-2213-ee28-1f4fda2e2ca31561uio_books_raw_v1
7Race Car Engineering Mechanics Paul Van Valkenburgh7e409a9a-ebe1-2165-801a-1254a8d535351261uio_books_raw_v1