Formalizing Materials R&Dmodular reference
Home/Materials reference/Orientation & the Rosetta stone
Materials reference · Orientation

Orientation & the Rosetta stone

How to read this reference as an engineer, the materials↔EE/CS Rosetta stone, the P-/RD-code legend, and the concept→problem index.

~14 min read

This is a from-scratch materials-science reference for engineers — electrical engineering, robotics, computer science — with no chemistry background. Its job is to make the domain behind the ARiSE proposal legible in your language: every materials term is defined on first use and carried by an explicit analogy to signals, systems, control, state machines, sensors, or linear algebra. The companion math reference teaches the algebra; this one teaches the physical world that algebra is about.

You do not have to read this linearly. Skim this front door, lock in the Rosetta stone below, then jump to whatever module you need via the concept index.

How to use this reference

Read this with one thesis in the back of your mind, because every module is a refinement of it:

The whole thing in one sentence. Industrial “dirty” materials — above all paints and coatings, which are multi-component dispersed systems (solids suspended in a liquid) — are hard because composition does not determine performance: the same ingredients can yield very different products, because performance depends on the process history (mixing order, shear and thermal history) acting through a hidden microstructural state, and many decisive steps (drying, curing, aggregation) are irreversible.

That single fact spawns the rest: requirements are multi-objective and interfering, much decisive know-how is tacit (never written down), and every measurement is partial and noisy. The ARiSE draft proposes to make all of this AI-ready; the synthesis formalizes why it is hard with category theory; this reference gives you the physics underneath both.

This reference is one of a set of companion documents:

  • The synthesis (algebra-ch0_formalizing-materials-RnD_synthesis.md) states the thesis — that materials R&D is best modelled with categories and algebra. It is the why-it-is-hard. This reference is the physical what.
  • The math reference (a parallel modular site) is the formal toolkit. When a phenomenon here wants an algebraic home — a hidden state, a non-commuting process, an aliasing sensor — that reference supplies it.
  • The ARiSE draft is the original problem-and-AI-solution document the whole effort formalizes.

Throughout, small chips tie the physics back to the synthesis. A P# chip names one of thirteen problem characteristics — the first-principles decomposition of what makes the domain hard. An RD# chip names one of eight research directions — the programmes the synthesis proposes. Both legends are spelled out in full below so this reference stands alone. Wherever you see a synthesis callout, it names exactly which P#/RD# the surrounding physics addresses.

Bridge. This is the materials half of the pair. For the algebra that formalizes everything here — why a hidden state is a quotient, why processing is a premonoidal category, why colour aliasing is a non-injective map — start at the math orientation page, which carries the same P#/RD# legend and its own concept index.

The Rosetta stone

This single table is the fastest on-ramp: the materials-science world translated, row by row, into EE / control / robotics / CS. Each row is unpacked, with its caveats, in the module the concept index points to. If you read nothing else on this page, read this.

Materials-science concept Analogue in EE / control / robotics / CS
A “dirty” dispersed material (paint, slurry) a metastable many-body system held in an engineered local energy minimum
Microstructure the hidden / latent state — the HMM state, the state vector a Kalman observer must estimate
Composition (the recipe / formulation) a parameter vector / source code — necessary, not sufficient to fix the output
Processing (mix, mill, coat, dry, cure) a pipeline of stateful, side-effecting operations on one mutable object
Mixing order matters non-commuting operations / order-dependent writes to shared state
Shear / thermal history; thixotropy hysteresis; rate-dependent stateful response; an annealing schedule (the path, not the endpoint)
Irreversibility (drying, curing) one-way, lossy, non-invertible state transitions (like hashing or quantizing)
Curing / gelation crossing a percolation threshold — a spanning “giant component” appears in a graph
Process → Structure → Property (PSP) a state-space system: input → hidden state → output (\(y=g(x)\), \(x=f(\text{process})\))
Degeneracy (many recipes → one property) a non-injective (lossy) map; hash collisions; an under-determined inverse problem
Viscosity impedance / gain linking stress (effort) to shear rate (flow): \(\tau=\eta\,\dot\gamma\) — Ohm’s law for flow
Shear-thinning a gain that drops as the drive rate rises
Viscoelasticity (\(G'\), \(G''\)) complex impedance \(G^{*}=G'+iG''\) (storage = reactive, loss = resistive); \(\tan\delta\) = loss tangent
Colloidal (DLVO) stability superposed attractive + repulsive potentials → a metastable well + activation barrier (Arrhenius escape)
Zeta potential / Debye length a tunable repulsive “gain” / an RC screening length (add salt → faster screening)
Diffusion; barrier (corrosion) the diffusion / heat equation (a spatial low-pass); an RC / lossy transmission line to mass transport
Scattering vs particle size feature size vs wavelength (antenna / aperture vs \(\lambda\)): Rayleigh \(\sim\lambda^{-4}\), Mie resonance at \(d\approx\lambda\)
Colour / colorimetry projecting an \(\infty\)-dim spectrum onto a 3-vector basis (a lossy 3-channel sensor); metamerism = aliasing
Measurement / characterization indirect, noisy, partial sensing of a hidden state — an observability problem
EIS (corrosion test) literally impedance spectroscopy: \(Z(\omega)\), Bode / Nyquist, equivalent-circuit fit
DLS / laser diffraction an inverse problem: deconvolve a size distribution from a scattering signal
Rheometry system identification of a mechanical network (flow curve = DC gain; oscillatory = Bode plot)
Multi-objective interfering specs constrained multi-objective optimization; a Pareto front (no single optimum)
Sensory vs instrumental target sounds good” vs an audio spectrum: a learned map from features to perceptual labels
Tacit / failure knowledge an expert policy never written down; production tricks not in the repo
Abstract Card / QRA+PSE a typed, schema-enforced record (vs free-text logs) — a documented function with preconditions + tests
Materials informatics (PSP) the ML playbook (featurize → fit → predict → optimize) over a hidden-state pipeline
DoE / mixture design optimal experiment design; sampling a simplex-constrained input space (a probability / softmax vector)
Autonomous / self-driving lab robotics + active learning: a perception → action → learning loop
Forward / inverse surrogate a learned plant model + its inverse (inverse design = a decoder)
Active learning / Bayesian optimization acquisition-function-driven sampling under uncertainty (explore / exploit)
Data-scaling law on dirty data a learning curve \(\text{error}\propto N^{-\alpha}\) — lets you budget data ROI

Intuition. Hold one picture above all others: a paint is a plant with hidden state. You can read the recipe (the input) and you can measure the dried film (the output), but the thing that actually decides performance — the microstructure — lives in between, is never directly observed, and is written by an order-dependent, partly irreversible process. Almost every difficulty in the domain is a corollary of that one diagram.

Reading paths

Pick the route that matches why you opened this reference. Each lands you in the right module fast.

  • Fastest orientation. This Rosetta stone → Microstructure (the hidden state — the crux) → R&D system (R&D as a control / active-learning loop — your home turf) → Case studies (the two worked examples).
  • “Why is composition not enough?” Dirty materials (the objects) → Processing (process and irreversibility) → Microstructure (the latent state performance factors through).
  • “I want the physics with equations.” Physics: rheology = complex impedance, DLVO = a double well, diffusion = the heat equation, scattering = antenna-vs-\(\lambda\).
  • “I care about the data / automation pipeline.” R&D system (loop, DoE, surrogates, active learning, self-driving lab) → Measurement (what the sensors actually report) → Case studies Case A (the scaling-law / data-ROI result).
  • “Connect it back to the category theory.” The bridge, then jump to the named modules of the math reference.

The problem codes (P1–P13)

The P-codes are the synthesis’s first-principles decomposition of what makes industrial-materials R&D fundamentally hard. Each is a structural feature of the problem, not of any one solution.

Code Problem characteristic
P1 Path-dependence / non-commutativity
P2 Composition + irreversibility (→ monoid / category, not group; precisely → premonoidal / effectful category)
P3 Mediation / factorization through hidden intermediate states
P4 Degeneracy (many↔︎one; fibers / quotients / invariants)
P5 Multi-objective interfering constraints (Pareto)
P6 Locality / scope-of-validity + gluing
P7 Transferability / horizontal deployment (functoriality)
P8 Multi-component coupling (tensor / multilinear)
P9 Sensory↔︎physical duality
P10 Knowledge as a structured, composable, verifiable object
P11 Canonical decomposition into irreducible atoms
P12 Quantifiable returns / scaling laws
P13 Invariants & canonical forms

The research directions (RD1–RD8)

The RD-codes are the synthesis’s eight concrete research directions — each a mathematical programme aimed at one or more of the P-codes above.

Code Research direction
RD1 Effectful / premonoidal process-category for dispersed materials with hidden state (performance = functor)
RD2 Trace-monoid recipe algebra + confluent rewriting normal forms
RD3 “Right hidden variable” = universal coimage (sharpening the PSP linkage)
RD4 Transfer-as-functoriality / ologs / functorial data migration for tacit knowledge
RD5 Symmetric-algebra mixture models + \(C(n,k)\) interaction budget
RD6 Sheaves for scope-of-validity + FCA concept lattices
RD7 Degeneracy / robustness via failure-of-UFD / singularity (cf. sloppy models)
RD8 Derived-functor / Ext obstruction theory for irreversibility / emergence

Concept → problem → module

This is the master index. Find the theme you care about, note the P#/RD# it serves, and follow the link to the module that develops it. Several themes span multiple modules — the physics is as interconnected as the algebra.

Theme Where it lives P# / RD#
Composition ≠ performance Dirty materials, Microstructure P1, P3
Process history; mixing order; irreversibility Processing P1, P2, RD1
Microstructure = hidden state Microstructure P3, RD1, RD3
Degeneracy (many recipes → one property) Microstructure, Case studies P4, RD7
Multi-objective / Pareto trade-offs Properties P5
Sensory ↔︎ instrumental Properties, Measurement P9, RD6
Multi-component coupling (non-additive) Properties, Physics P8, RD5
Governing physics & regimes (scope) Physics P3, P6, P8
Measurement = noisy partial sensing; dirty data Measurement P6, P9, P12
Invariants / canonical descriptors Microstructure, Measurement P13
The R&D closed loop; DoE; surrogates; active learning R&D system P5, P12, RD1
Tacit / failure knowledge; Abstract Card; QRA+PSE R&D system P10, RD4
Transfer across equipment / scale / domain Processing, R&D system P7, RD4
Data-scaling law / ROI R&D system, Case studies P12
The bridge to the algebra The bridge all

In the synthesis. This whole reference is the physical grounding for the synthesis’s argument. Its centre of gravity is P3 (performance mediates through a hidden microstructural state) and RD1/RD3 (model that state as a premonoidal process-category whose “right hidden variable” is a universal coimage). Read Microstructure first to see why.

Acronyms

A compact lookup for the abbreviations used across the modules; each is defined again where it is introduced.

Acronym Meaning
PSP Process–Structure–Property (the central linkage)
PSD Particle-size distribution
SSA Specific surface area
PVC / CPVC Pigment volume concentration / its critical value
CMC Critical micelle concentration
DLVO Derjaguin–Landau–Verwey–Overbeek (colloid-stability theory)
\(T_g\) Glass-transition temperature
2K Two-component (reactive) system
VOC Volatile organic compound
LCA Life-cycle assessment
ICME / MGI Integrated Computational Materials Engineering / Materials Genome Initiative
HTE High-throughput experimentation
DoE / RSM Design of experiments / response-surface methodology
QRA+PSE The draft’s knowledge schema (Question / Rationale / Answer / Physics / Scope / Evidence / Counter-example)
CIE / L*a*b* / ΔE Commission Internationale de l’Éclairage / perceptual colour coordinates / perceptual colour difference
GU / DOI / BRDF Gloss units / distinctness of image / bidirectional reflectance distribution function
TSR / NIR / IR / UV Total solar reflectance / near-infrared / infrared / ultraviolet
UV-Vis / FTIR / Raman Spectroscopies
DLS Dynamic light scattering
SEM / TEM / AFM Scanning / transmission electron microscopy; atomic force microscopy
DSC / TGA Differential scanning calorimetry / thermogravimetric analysis
EIS Electrochemical impedance spectroscopy
QUV / ASTM B117 A UV/condensation accelerated-weathering test / standard salt-spray (fog) test
TiO₂ / SiO₂ / CaCO₃ / BaSO₄ Titania / silica / calcium carbonate / barium sulfate (pigments / fillers)
PVDF / DMAc / MEK A fluoropolymer binder / a solvent / methyl ethyl ketone (rub-test solvent)

Key rheology and physics symbols: \(\tau\) shear stress (Pa) · \(\gamma\) shear strain · \(\dot\gamma\) shear rate (s⁻¹) · \(\eta\) viscosity (Pa·s) · \(\tau_y\) yield stress · \(G'\) storage modulus · \(G''\) loss modulus · \(G^{*}\) complex modulus · \(\tan\delta\) loss tangent · \(\theta\) contact angle · \(\gamma\) (interfaces) surface / interfacial energy (J/m² ≡ N/m) · \(\zeta\) zeta potential (mV) · \(\kappa^{-1}\) Debye length · \(D\) diffusion coefficient (m²/s) · \(n\) refractive index · \(\lambda\) wavelength · \(\alpha\) absorption coefficient (and, separately, a power-law exponent).

Units & scales

A quick orientation to the numbers you will meet. Most quantities here are scale-dependent, so a value without its scale (especially viscosity without a shear rate) is meaningless.

Quantity Scale / typical values
Length 1 nm \(=10^{-9}\) m (molecules, nanostructure); 1 µm \(=10^{-6}\) m (pigment particles, microstructure; visible \(\lambda \approx 0.4\)\(0.7\) µm); a coating film \(\approx 10\)–100 µm \(= 0.01\)–0.1 mm. Colloidal \(\approx 1\) nm–1 µm.
Viscosity Pa·s (water \(\approx 0.001\) Pa·s \(= 1\) mPa·s; paints span a wide range and are shear-dependent — always report the shear rate).
Moduli / stress Pa (\(G'\), \(G''\), yield stress, mechanical strength).
Surface / interfacial energy mJ/m² ≡ mN/m (water surface tension \(\approx 72\) mN/m).
Zeta potential mV (\(\lvert\zeta\rvert \gtrsim 25\)–30 mV ⇒ usually stable).
Colour \(L^{*}\) runs 0–100; \(\Delta E \approx 1\) ≈ just-noticeable difference.
Gloss / sheet resistance GU at a stated angle (20° / 60° / 85°); Ω/sq.
IR wavenumber cm⁻¹ (mid-IR \(\approx 4000\)–400 cm⁻¹ ≈ 2.5–25 µm; the thermal / atmospheric window ≈ 8–13 µm).
Scaling-law shape \(\text{error} \approx a\,N^{-\alpha}\) — a straight line on a log-log plot of error vs dataset size \(N\).

Recap

  • One mental model. A paint is a plant with hidden state: recipe in, dried film out, an unobserved microstructure in between, written by an order-dependent and partly irreversible process. Everything else follows.
  • The Rosetta stone is the map. Microstructure = latent state, processing = a stateful pipeline, viscosity = impedance, colour = a lossy 3-channel sensor, characterization = an observability problem.
  • Navigate, don’t read linearly. Use the reading paths for a route and the concept index to jump straight to a module; trust the P#/RD# chips to connect each phenomenon to a real R&D problem.
  • The crux is P3. Composition does not determine performance because it factors through a hidden state — start at Microstructure.
  • Two halves, one effort. This is the physical what; the math reference is the formal how; the synthesis argues from the hardness to the algebra.

Part of a four-document set: the ARiSE draft (problem + AI solution), this modular Materials-science reference, the companion math reference, and the synthesis. Generated from modular Markdown with a custom static-site builder.

Mathematics is typeset with MathJax (loaded once from a CDN with Subresource Integrity; needs network on first view). Diagrams are inline SVG and follow the light/dark theme. Keyboard: / search · [ ] prev/next · t theme.