Why Law Firms Should Wait on Copilot Until Q3 2025
Microsoft is testing DeepSeek as an enterprise AI option inside Copilot, and the pricing implications are significant for law firms currently evaluating annual seat commitments. If the reports hold, firms could see per-seat costs drop from $30 per month to somewhere between $6 and $12 by late Q3 2025, with usage-based pricing replacing the flat subscription model.
For a 15-attorney firm paying $5,400 annually for Copilot seats, that’s a potential saving of $3,200 to $4,300 per year. For a 40-attorney practice, the difference is $8,600 to $11,500. The math gets interesting when you’re renewing now or considering a firm-wide rollout.
The question isn’t whether to use AI. It’s whether to lock in today’s pricing or wait 90 days to see if Microsoft ships a materially cheaper option that does the same work.
What Microsoft is actually testing
DeepSeek is a Chinese AI model that performs comparably to GPT-4 on most business tasks but costs a fraction to run. Microsoft is reportedly evaluating it as a second engine inside Copilot, giving enterprise customers a choice between the existing OpenAI backend and a lower-cost alternative for routine work.
The shift matters because most legal AI use cases don’t need the full horsepower of GPT-4. Summarising a discovery batch, drafting a conflict-check memo, or triaging an intake form are high-volume, low-complexity tasks. They’re also the tasks that eat up associate time and create the 4 to 6 hours per week of unbilled work that never makes it onto a client invoice.
If DeepSeek lands in Copilot with usage-based pricing, firms will pay only for the queries they run, not for seats that sit idle half the month. That’s a fundamentally different cost structure, and it favours smaller practices that don’t need 24/7 AI access for every attorney.
The timeline is still fluid. Microsoft hasn’t announced a release date, and the Yahoo report cites internal testing without a firm commitment. But the pattern is clear: enterprise buyers are pushing back on flat-rate SaaS pricing, and vendors are responding with tiered, usage-sensitive models. We’ve seen this play out in analytics, cloud compute, and now AI tooling.
For law firms, the practical takeaway is simple. If you’re three months from renewal, wait. If you’re evaluating a firm-wide Copilot rollout, delay until Q3 and see what pricing lands. The opportunity cost of waiting is low. The cost of locking in a 12-month contract at double the eventual rate is not.
Where law firms actually leak money on AI work
The Copilot conversation tends to focus on productivity gains, but the real cost in a legal practice isn’t speed. It’s leakage. Time spent on work that doesn’t bill, clients who call after hours and never get a callback, and junior associates reviewing documents at $300 per hour when the task could be automated for $15.
Billable-hour leakage is the silent killer. Every attorney in a small to mid-sized firm loses 4 to 6 hours per week on admin, intake triage, and document prep that never makes it onto an invoice. That’s 200 to 300 hours per year, per attorney. At a blended rate of $250 per hour, a 10-attorney firm is leaving $500,000 to $750,000 on the table annually.
Intake delays are worse. A high-intent caller reaches your firm at 6:15 p.m. on a Tuesday. No one answers. They leave a voicemail. By the time someone listens to it the next morning, the caller has already booked a consultation with a competitor. We see this pattern in 30 to 40 percent of after-hours intake across professional services. For law firms, where the average new-client lifetime value ranges from $8,000 to $50,000 depending on practice area, that’s a direct revenue hit.
Document review is the third pressure point. A junior associate spends two days on first-pass contract review for a commercial dispute. The client gets billed $4,800 for 16 hours of work. The associate hates it. The partner wishes they could bill more but knows the client will push back. And the work itself is repetitive, rules-based, and a perfect fit for an AI agent that can do the same review in 90 minutes for a fraction of the cost.
These aren’t Copilot problems. Copilot is a co-pilot, it still requires a human in the seat. The work I’m describing doesn’t need a human in the loop. It needs an agent that runs autonomously, handles the task end to end, and surfaces a result that a human can review in five minutes instead of five hours.
That’s the difference between a tool and a system. Tools make your people faster. Systems replace the work entirely.
What an agent-first approach looks like in a law firm
We build three types of agents for legal practices, and they map directly to the three leakage points above. The first is an Intake Voice Agent that answers every inbound call, conflict-checks the caller, captures the matter details, and books a consultation into the firm’s calendar. It runs on Omni voice, handles after-hours and weekend calls, and converts 60 to 75 percent of callers who would otherwise hang up and move on.
The second is a Matter Triage Agent that monitors the firm’s intake email and web forms, classifies each submission by practice area, scores the lead based on matter value and fit, and routes it to the right partner with a one-paragraph brief attached. It runs on Omni ops, eliminates the morning triage meeting, and cuts intake-to-response time from hours to minutes.
The third is a Document Review Agent that performs first-pass review on contracts, discovery batches, and matter files. It flags clauses, summarises positions, produces an associate-grade memo, and hands the file back to a human for final review. It also runs on Omni ops, and it turns a two-day associate task into a 90-minute agent task plus 20 minutes of partner review.
None of these agents require Copilot. They don’t require a human to prompt them, guide them, or sit next to them while they work. They’re autonomous, they run on a schedule or a trigger, and they produce a finished output that a human validates and ships.
The cost structure is also different. Copilot charges per seat per month, whether you use it or not. Omni charges based on agent activity, so you pay for the work that actually happens. For a 12-attorney firm running an Intake Voice Agent and a Matter Triage Agent, the monthly cost typically falls between $800 and $1,400, depending on call volume and intake load. That’s less than five Copilot seats, and it’s doing work that would otherwise require a full-time intake coordinator plus 10 hours per week of associate time.
The ROI isn’t theoretical. One litigation practice in our network added an Intake Voice Agent in January and saw after-hours conversion jump from 18 percent to 68 percent in the first 60 days. That’s 14 additional consultations per month, and six of those turned into retained clients. At an average case value of $12,000, the agent paid for itself in week three.
Another commercial firm deployed a Document Review Agent for contract redlining and cut first-pass review time by 72 percent. The associate who used to spend 12 hours per week on contract review now spends three hours validating agent output and focusing on the clauses that actually matter. The client still gets billed for associate time, but the firm’s internal cost dropped by $180 per contract, and the partner can now take on two additional clients per quarter without hiring.
If you want a structured way to evaluate where an agent fits into your intake process, we built a worksheet that walks through the decision points. You can grab the AI Client Intake Checklist for Law Firms and use it to map your current intake flow against the agent model. It’s a 20-minute exercise, and it’ll show you exactly where the leakage sits.
Why the Copilot pricing shift matters even if you don’t use Copilot
The broader point is that enterprise AI pricing is moving from seat-based to usage-based, and that shift changes the economics for every professional services firm evaluating AI tooling. Seat-based pricing made sense when software was a static tool that sat on a desktop. Usage-based pricing makes sense when the software is doing work autonomously and the value is in the output, not the access.
Law firms have been conditioned to think about software as a per-user cost. Practice management, document storage, research databases, all priced per attorney. But agents don’t map to that model. An Intake Voice Agent doesn’t need a seat. It needs call volume. A Document Review Agent doesn’t need a login. It needs documents to process.
If Microsoft ships DeepSeek inside Copilot with usage-based pricing, it’ll validate what we’ve been saying for 18 months. The future of legal AI isn’t a smarter Word plugin. It’s autonomous agents that do the work your associates don’t want to do, at a cost that makes sense for a 10-attorney firm, not just a 200-attorney firm.
And if you’re a partner evaluating AI spend right now, the decision tree is straightforward. If Copilot is already deployed and your team is using it daily, renew. If you’re evaluating a new rollout or coming up on renewal in Q2 or Q3, wait. The pricing landscape is shifting, and the cost difference is material enough to justify a 90-day delay.
In the meantime, focus on the work that Copilot can’t do. Intake. Triage. First-pass review. These are the tasks that leak hours and revenue, and they’re the tasks that agents handle better than co-pilots because they don’t require a human in the loop.
What an Omni Audit looks like for a law firm
We run a 60-minute diagnostic for legal practices that want to see where agents fit without committing to a six-month implementation. It’s called an Omni Audit, and it’s built for partners who don’t have time for a deck and a discovery process. You can book a 60-min Omni Audit directly, and we’ll walk through your intake flow, document pipeline, and matter triage process in one call.
The output is three things. First, a leakage map that shows where unbilled hours and lost clients are sitting in your current process. Second, a one-page agent spec that describes exactly what an Intake Voice Agent or Matter Triage Agent would do in your firm, with cost and ROI ranges. Third, a 90-day implementation plan that breaks the build into phases so you’re not deploying everything at once.
We don’t pitch a platform. We don’t sell seats. We build the agents your firm actually needs, and we charge based on the work they do. If an Intake Voice Agent handles 60 calls per month, you pay for 60 calls. If a Document Review Agent processes 15 contracts, you pay for 15 contracts. The pricing scales with your caseload, not your headcount.
For most firms in the $1M to $25M range, the audit identifies $80,000 to $250,000 in annual leakage, and the agent build recovers 40 to 60 percent of that in the first year. The payback period is typically 90 to 120 days, and the ongoing cost is less than a mid-level associate’s salary.
If you want to see how the AI audit for law firms works in practice, the page walks through the process and shows sample outputs from firms we’ve worked with. It’s not a lead form. It’s a breakdown of what happens in the 60 minutes and what you walk away with.
The Copilot question is a timing question. The agent question is a revenue question. One is about whether to wait 90 days for better pricing. The other is about whether you’re leaving $150,000 per year on the table because your intake process still relies on voicemail and your associates are still doing first-pass contract review by hand.
You can solve both, but the second one doesn’t require you to wait for Microsoft.
What to do in the next 30 days
If you’re renewing Copilot in Q2 or Q3, push the decision to late Q3 and see what pricing Microsoft ships. If you’re evaluating a firm-wide rollout, delay until the DeepSeek option is live or confirmed dead. The opportunity cost of waiting is low, and the potential saving is 60 to 80 percent of your annual AI spend.
If you’re not using Copilot but you’re losing 4 to 6 hours per attorney per week on unbilled work, start with intake. An Intake Voice Agent is the highest-ROI deployment for most small to mid-sized practices, and it’s the easiest to validate because the conversion lift shows up in your CRM within 30 days.
If you’re already handling intake well but your associates are drowning in document review, deploy a Document Review Agent for one practice area and measure the time savings over 60 days. If it works, expand it. If it doesn’t, you’ve spent $1,200 and learned something useful.
And if you want to see where the leakage sits in your firm without committing to a build, book my Omni Audit and we’ll map it in one call. You’ll walk away with a leakage number, an agent spec, and a 90-day plan. No deck, no follow-up meeting, no multi-month discovery process.
The AI pricing landscape is shifting, and law firms that wait for clarity on Copilot while ignoring the work that agents can do today are solving the wrong problem. Copilot makes your people faster. Agents replace the work that shouldn’t require people in the first place.
You can read more about how we approach AI implementation for professional services firms in our insights library, or explore the Omni platform to see how voice and ops agents work together. The tools are ready. The pricing is moving in your favour. The only question is whether you’re going to wait for Microsoft or start recovering the $150,000 you’re leaving on the table this year.