Introduction The benefits of custom software engineering services depend on a distinction that is rarely made explicit at proposal time: is the problem an engineering task or a research question? Engineering tasks have known methods, predictable data, bounded uncertainty — a custom-software engineering team delivers reliably against a schedule. Research questions have open novelty, unbounded data quality concerns, no reliable baseline — schedules and deliverables look different. This article frames the distinction, explains how to recognise each, and explains why the wrong classification consumes budget without producing outcomes. See the services landing for the broader programme. The corrected approach is classify-then-scope: identify whether the work is engineering or research first, then choose contracting model, schedule, and deliverables accordingly. What this means in practice Engineering tasks have known methods; research questions have open methods. Misclassifying research as engineering produces missed schedules and lost budget. Different contract models (fixed-price vs time-and-materials) fit different classes. The distinction also applies to GenAI feasibility (per-use-case). How do I tell whether an AI problem is an engineering task or a research question? The diagnostic questions: Is there a known method? An engineering task has a method known to work for problems of this type — published, demonstrated, sometimes in production elsewhere. A research question has no known reliable method; methods are candidates to test. Can a similar system be pointed to? Engineering task: yes, similar systems exist. Research question: closest existing systems are demonstrably insufficient. Are the input properties bounded? Engineering: input properties (format, quality, volume, distribution) known in advance and bounded. Research: input properties unknown, unstable, or unbounded. What is the failure profile? Engineering: failures are operational, fixable, recoverable. Research: failures are existential — the method might just not work. Are evaluation criteria well-defined? Engineering: clear metrics, target thresholds, acceptance criteria. Research: evaluation itself may be an open question. Is there an existing baseline? Engineering: established baseline against which improvements are measured. Research: no baseline, or the baseline is the open question. The signal pattern. An engineering task has multiple “yes” answers above. A research question has multiple “unknown” or “no” answers. Some problems are mixed: parts are engineering, parts are research. The mixed case needs decomposition. The discipline. Run this diagnostic at proposal time. Be honest. The unwillingness to classify something as research often comes from contracting pressure, not from genuine certainty. Which signals (known method, predictable data, bounded uncertainty) classify a problem as engineering? The engineering signals: Known method published with positive results. Published papers, open-source implementations, vendor-shipped products. The method has been applied to similar problems with documented success. Existing production systems with similar capability. Competitors, partners, or adjacent products demonstrate the capability is feasible. Predictable input data. The input properties (resolution, latency, distribution, language) are known in advance and stable. The team can construct representative training/evaluation data. Bounded uncertainty. The team can estimate the achievable accuracy range from prior work. The uncertainty interval is reasonable for the use case. Clear acceptance criteria. The customer can describe what “good enough” means in terms the engineering team can measure. Established team expertise. The team has done similar work; the expertise gap is bounded; learning curve is predictable. Available standard tooling. The technology stack is mature; the team is not building tools to do the work. Decomposable work. The problem can be broken into components with predictable schedules; risks are at the component level. Result. The work fits a project (fixed scope, schedule, budget) with engineering-grade confidence intervals. Fixed-price contracts can be reasonable; time-and-materials with capped scope works. Which signals (open novelty, unbounded data quality, no reliable baseline) classify a problem as research? The research signals: Method is open. The team is not sure which method will work; the work involves testing candidate methods and learning which works (or doesn’t) in this context. Data quality is unknown or unbounded. The data the system will see is highly variable, contains failures the team hasn’t characterised, may require collection or labelling investment of unknown scope. No reliable baseline. There’s no prior system performing this task at acceptable level; the team is exploring whether one is possible. Performance ceiling is unknown. The team can’t bound what accuracy/performance/latency is achievable; the question itself is open. Failure modes are unknown. Until the team has built and tested several candidates, the failure modes are not predictable. Evaluation may be open. How to measure success may itself be a research question; standard benchmarks may not apply. Expertise gap. The work requires expertise the team is acquiring, often through the project itself. Stakeholder expectations are misaligned. The stakeholder may expect engineering outcomes for what is actually research; this misalignment is itself a signal. Result. The work fits a research programme (open-ended schedule, exploratory budget, optional pivot). Fixed-price contracts are inappropriate; time-and-materials with regular evaluation gates and pivot options is appropriate. How is project scope, schedule, and budget framed differently when the work is research rather than engineering? Engineering project framing: Scope. Specific deliverables defined: API specifications, performance thresholds, integration requirements, documentation, training. The work has a definition of done. Schedule. Time-bounded with milestones. Risk buffer included for execution risk, not for method risk. Budget. Estimated based on team size, schedule, infrastructure. Variance bounded. Contract model. Fixed-price with risk buffer; or T&M with capped scope. Acceptance criteria explicit. Research project framing: Scope. Investigation focused on specific question; deliverables include research artefacts (experiments, evaluations, recommendation) rather than only production code. The work has a definition of done at the question level. Schedule. Time-bounded for the investigation (e.g., 3-month research project); not for the final-product delivery (which depends on what the research finds). Budget. Allocated for investigation; production-system budget allocated separately, conditional on research outcome. Contract model. T&M with regular review gates. At each gate, decision to continue, pivot, or stop. Production-system contract follows separately. Mixed-mode framing: When the work has both research and engineering components, split: Research phase. Time-bounded investigation with research deliverables (proof-of-concept, evaluation report, recommendation). Engineering phase. Production build conditional on research outcome; separate scope, schedule, budget. Decision gate. Between phases, customer decides whether to proceed based on research findings. The contracting honesty. Refuse to sign a fixed-price engineering contract for research-class work. Refuse to deliver research outcomes within an engineering schedule. Setting expectations correctly is the customer-trust foundation. Why do projects framed as engineering when they were actually research consume budget without producing outcomes? The failure pattern: Budget consumed exploring methods. The team tries one approach, doesn’t work, tries another, doesn’t work, tries a third. Each attempt consumes engineering hours. The aggregate is research, but the budget was engineering. Schedule slips. Each method-attempt overruns the engineering schedule. The team is repeatedly behind. Stakeholders escalate. Quality is sacrificed. Under schedule pressure, the team accepts methods that almost work; production quality is below engineering threshold; customer is disappointed but stuck. Stakeholder trust degrades. The team appears to be failing; in reality, the work was misclassified. The trust loss is permanent. Scope is reduced under pressure. The customer accepts reduced scope to land within budget; the reduced-scope system may not actually solve the original problem. Pivot is delayed or never happens. Because the contract framed engineering, pivot conversations are framed as failure. The team continues trying the original framing instead of acknowledging the research nature. The team carries the cost. Custom-software engineering teams that misclassify regularly end up subsidising research with their own budget. Margins erode; in extreme cases, the team is unprofitable. The customer is also harmed. The customer paid for an engineering outcome; received less; trust eroded. The economic value of the work to the customer is reduced even if the contract was technically delivered. The root cause. The proposal stage didn’t separate engineering from research. Often this is because: the customer doesn’t know the difference; the engineering team is incentivised to win the contract; the actual problem novelty was discovered only after start. The remedy. Diagnose at proposal time. If uncertain, propose a short scoping engagement before any commitment. The scoping engagement determines classification, then the main engagement is framed correctly. How does the engineering-vs-research distinction relate to per-use-case GenAI feasibility (TK3-CCU-04)? The relationship: Per-use-case GenAI feasibility is a specific application of the engineering-vs-research distinction. For a specific GenAI use case, the question “is this engineering or research?” has a specific form: “is this GenAI deployment a method known to work for this kind of problem?” GenAI engineering signals: The use case has been demonstrated by similar deployments. Documented evidence: case studies, vendor demonstrations, peer companies. The model behaviour is predictable enough for the application. Hallucination rate, accuracy, latency known and acceptable. The integration is standard. Standard patterns (RAG, function-calling, prompt-engineering) suffice. GenAI research signals: The use case is novel; no published evidence of similar success. The model behaviour is unpredictable for the application. Hallucination is a problem; accuracy is variable; mitigation strategies are not known to work. The integration requires invention. Novel multi-step reasoning, novel agentic patterns, novel evaluation. The team is inventing. The mapping. Engineering GenAI cases: chatbots over known content, document summarisation, structured-data generation, well-evaluated copilots. Research GenAI cases: agentic systems over open environments, novel reasoning capabilities, complex multi-step workflows without baseline, evaluation methods that don’t yet exist. The cross-reference (TK3-CCU-04). GenAI feasibility per-use-case is the engineering-vs-research test applied to a specific GenAI deployment. The same diagnostic questions apply; the same contracting consequences apply. If GenAI feasibility is genuinely uncertain, treat as research, not engineering. The 2026 reality. Many GenAI deployments are being contracted as engineering when they’re actually research; the failure pattern of the previous question is playing out at scale across the GenAI market. The discipline of classifying first is more important in 2026 than ever before. The strategic implication. Custom software engineering teams that maintain this discipline ship more reliably, retain customer trust, and avoid the margin erosion that comes from chronically misclassifying. Teams that ignore the distinction get caught in the failure loop. How TechnoLynx Can Help TechnoLynx works with customers on scoping engagements that classify work as engineering or research before contracting, and on engineering and research deliveries that match the classification. We focus on getting the framing right before the work starts. If your team is scoping AI work and the classification is unclear, contact us. Image credits: Freepik