Skip to main content
Contents
Back to sourcing guides

Vendor Scorecards: How to Measure Supplier Performance

A practical guide for small businesses on creating and using vendor scorecards to measure quality, delivery, and cost performance.

When a company is small, vendor management happens by intuition. You know Bob at the machine shop is reliable, and you know the steel distributor is always late on Tuesdays. But as you scale—adding more vendors, more parts, and more buyers—intuition breaks down.

“Gut feeling” management leads to hidden costs: the premium freight you paid because a vendor was late (again), the rework hours spent fixing “minor” defects, and the gradual price creep that goes unnoticed. To scale effectively, you need an objective way to measure who your best partners actually are.

The 5 Pillars of Supplier Performance

A scorecard doesn’t need to be complex, but it should cover more than the basics. For small-to-mid-sized manufacturers, these five metrics form a complete picture of supplier health.

1. On-Time Delivery (OTD)

The Metric: (Number of orders received on or before the promised date) / (Total number of orders).

Why it matters: Late deliveries disrupt your production schedule and force you to carry excess “safety stock” inventory, tying up cash.

Measurement nuances: Define “on time” precisely. Is it based on the original promised date or the most recently confirmed date? If a supplier pushes their delivery date out twice and then hits the revised date, is that “on time”? Most rigorous programs measure against the original commitment — otherwise, a supplier who consistently pushes dates can still score 95% OTD.

OTD LevelWhat It MeansBusiness Impact
98-100%World-class reliabilityMinimal safety stock needed, predictable scheduling
95-97%Strong performerOccasional buffer needed, manageable disruption
90-94%AverageRegular safety stock required, some schedule adjustments
80-89%ProblematicFrequent expediting, premium freight costs, customer impact
<80%UnacceptableProduction stoppages, revenue at risk, immediate action required

2. Quality (Defect Rate)

The Metric: (Number of rejected or nonconforming parts) / (Total parts received).

Why it matters: The “cheapest” vendor is often the most expensive if 5% of their parts require rework. Track both total rejections and “concessions” (parts you accepted but had to fix).

The hidden cost of defects: A 2% defect rate sounds minor until you calculate the total cost of quality (TCOQ):

  • Sorting cost: Labor to inspect incoming material ($25-$50/hour)
  • Rework cost: Labor and machine time to fix nonconforming parts
  • Scrap cost: Material that cannot be recovered
  • Return shipping: Freight to send defective parts back
  • Line stoppage cost: If defects reach production before detection ($500-$5,000+ per hour)
  • Customer impact: Late shipments, warranty claims, reputation damage

A supplier quoting $10/part with a 3% defect rate may cost you $11.50/part when quality costs are loaded in. A competitor quoting $10.80 with a 0.5% defect rate is actually cheaper.

3. Cost Competitiveness

The Metric: Price variance against the market average, against your should-cost model, or against last purchase price.

Why it matters: Are they raising prices faster than the market? A good partner works with you to keep costs flat; a transactional vendor creeps them up every quarter.

Three ways to measure cost performance:

MethodBest ForLimitation
Year-over-year price changeTracking price creep on repeat partsDoes not account for legitimate market changes
Market index comparisonCommodity-heavy purchases (steel, aluminum, plastics)Requires access to index data and a mechanism to map to your parts
Should-cost varianceCustom fabrications, sole-source partsRequires a should-cost model (see our should-cost modeling guide)

4. Responsiveness

The Metric: Average time to respond to RFQs, engineering change requests, quality inquiries, and escalations.

Why it matters: A supplier who takes three weeks to return a quote on a simple part is signaling one of two things: they are overwhelmed (capacity risk) or you are not a priority (relationship risk). Either way, slow responsiveness becomes your bottleneck.

Benchmark ranges:

Request TypeGood Response TimeAcceptableNeeds Improvement
Standard RFQ (repeat part)1-2 business days3-5 business days>5 business days
New part RFQ3-5 business days5-10 business days>10 business days
Engineering change request2-3 business days5-7 business days>7 business days
Quality issue escalationSame day1 business day>2 business days

5. Documentation and Compliance

The Metric: Percentage of shipments received with complete, accurate documentation (certs, packing slips, inspection reports).

Why it matters: Missing documentation creates downstream delays — your quality team cannot release parts to production without material certs, your accounting cannot process payment without matching packing slips, and your customer may reject shipments without proper traceability records. In regulated industries (aerospace, defense, medical, nuclear), documentation failures can halt an entire project.

Building a Weighted Scorecard

Not every metric matters equally for every supplier. A raw material distributor should be weighted heavily on delivery and cost; a precision machine shop should be weighted on quality and responsiveness. The weighting reflects your priorities for that specific supplier category.

Example Weighting by Supplier Type

MetricRaw Material DistributorCNC Machine ShopAssembly / Sub-ContractSpecialty Process (Heat Treat, Plating)
On-Time Delivery30%25%30%25%
Quality15%35%30%35%
Cost Competitiveness30%20%15%15%
Responsiveness15%15%15%15%
Documentation10%5%10%10%

Scoring Scale

Use a consistent 1-5 scale across all metrics:

ScoreDescriptionAction Required
5 (98-100%)Strategic PartnerIntegrate deeper (e.g., VMI, co-development).
4 (95-97%)Preferred SupplierMaintain, monitor, consider volume increase.
3 (90-94%)StandardReview quarterly, identify improvement areas.
2 (80-89%)ProbationaryCorrective Action Plan required within 30 days.
1 (<80%)UnsatisfactoryResource immediately, begin qualification of replacement.

Calculating the Composite Score

Multiply each metric score by its weight and sum the results.

Example — CNC Machine Shop:

MetricWeightScoreWeighted Score
On-Time Delivery25%4 (96%)1.00
Quality35%5 (99.5%)1.75
Cost Competitiveness20%3 (at market)0.60
Responsiveness15%4 (3-day avg quote turnaround)0.60
Documentation5%3 (occasional missing certs)0.15
Composite Score4.10 — Preferred Supplier

This supplier is strong on quality and delivery but average on cost and documentation. The scorecard tells you exactly where to focus the next quarterly business review.

The Feedback Loop: How to Share Scores

The data is useless if you don’t share it. The goal of a scorecard isn’t to punish vendors; it’s to align expectations.

For High Performers: Send the scorecard with a “Thank You.” Acknowledging their performance builds loyalty. “You’re at 99% OTD this quarter. We appreciate the reliability and will be looking to increase our volume with you next year.”

For Underperformers: Use data, not emotion. “We noticed your OTD dropped to 82% this quarter. This caused two line stoppages. Can you help us understand the root cause and your plan to get back to >95%?” Most professional vendors will appreciate the clarity and work to fix it to protect the account.

Structuring the Quarterly Business Review (QBR)

For your top 10-15 suppliers (by spend or criticality), conduct a quarterly business review. Keep it focused:

Agenda (30-45 minutes):

  1. Scorecard review (10 min): Present the data. Let the numbers speak. Avoid subjective commentary — the scorecard is the conversation, not your feelings about the supplier.
  2. Root cause discussion (10 min): For any metric below target, discuss what happened and why. Was it a one-time event or a systemic issue?
  3. Corrective actions / improvement plan (10 min): If corrective action is needed, agree on specific actions, owners, and deadlines. Document them.
  4. Forward look (10 min): Share your upcoming demand forecast, any specification changes, and new projects they should be quoting. Giving suppliers visibility into your pipeline is one of the most valuable things you can do — and it costs nothing.

Handling the Difficult Conversation

When a supplier scores below acceptable thresholds, the conversation needs to escalate without becoming adversarial:

Level 1 — Verbal discussion at QBR: “Your OTD has been trending down for two quarters. What’s driving this?” Often, the supplier is aware and working on it. Sometimes, they are not aware because their own data differs from yours (their system says they shipped on time; your dock log says they arrived late).

Level 2 — Formal Corrective Action Request (CAR): A written document that identifies the problem, requests a root cause analysis, and requires a corrective action plan with dates. This is standard practice in manufacturing and should not be treated as adversarial — it is a professional mechanism for problem resolution.

Level 3 — Probation: Reduce volume allocation and begin qualifying an alternative. Communicate this clearly: “We are moving 30% of our volume to a backup source while we work through these quality issues. We want to continue the relationship, but we need to see sustained improvement before restoring full volume.”

Level 4 — Transition: If performance does not improve after a defined probation period (typically 2-3 quarters), begin a full transition to the replacement supplier. Ensure all tooling, documentation, and IP are transferred.

Avoiding Common Scorecard Mistakes

Measuring too many things. A scorecard with 15 metrics is a spreadsheet exercise, not a management tool. Five metrics, weighted by importance, tell you everything you need to know.

Measuring only what is easy. OTD and defect rate are easy to measure because the data is transactional. Responsiveness and documentation compliance require more effort to track but often reveal the most about a supplier’s commitment to the relationship.

Not weighting metrics. An unweighted scorecard treats a 2% cost increase the same as a 2% defect rate increase. These are not equivalent. Quality failures create cascading costs that far exceed a modest price delta.

Scoring too infrequently. Annual scorecards are post-mortems. By the time you identify a trend, you have lost two quarters of corrective action time. Quarterly is the minimum cadence for top suppliers; monthly is appropriate during new supplier ramp-ups or corrective action periods.

Not tying scores to consequences. A scorecard that generates a report but drives no action is a waste of everyone’s time. Scores must connect to volume allocation decisions, contract renewals, and preferred supplier status. Otherwise, the exercise is decorative.

Using Scorecard Data for Sourcing Decisions

Over time, your scorecard database becomes one of the most valuable assets in your procurement operation. It enables:

Evidence-based supplier selection. When awarding a new part, historical performance data supplements quote comparison. A supplier quoting 5% above the lowest bid but scoring consistently at 98%+ OTD and <0.5% defect rate is often the better total-cost choice.

Supplier rationalization. Scorecard data identifies which suppliers in a category are worth investing in and which should be phased out. If you have four machine shops and one consistently scores 2.0 while the others score 4.0+, the rationalization decision is clear.

Negotiation leverage. “Based on our data, your quality performance has been excellent — 99.6% acceptance rate over 8 quarters. We’d like to expand the relationship. Here is a new part family we are sourcing, and we’d like to offer you first right of refusal.” Performance data makes volume commitments credible and specific.

Trend detection. A supplier whose score drops from 4.5 to 3.8 over three quarters is sending a signal — even if no single metric has crossed a critical threshold. Trend analysis catches problems before they become crises.

Summary

A vendor scorecard is your GPS for the supply chain. It tells you exactly who is driving your business forward and who is holding it back, allowing you to make sourcing decisions based on fact, not fiction. Start simple — five metrics, weighted by importance, reviewed quarterly with your top suppliers. The conversation it creates is as valuable as the score itself.

Procurement intelligence for complex sourcing

Purchaser normalizes vendor quotes into structured, defensible sourcing data — automatically, from intake to award.

Quantify the case for change

Put numbers on the time and risk savings from replacing manual procurement workflows with structured automation.

See Purchaser on your data

In a short working session, we'll map your current workflow and show how Purchaser handles your vendor data.

  • How Purchaser ingests vendor submissions from email in any format
  • How scope deviations and assumptions are surfaced automatically
  • What structured bid comparison looks like on your data