NEWS

factorana 1.7.1 (2026-06-10)

Bug fixes

robust_se() now returns a named vector, as documented. It previously computed sqrt(pmax(0, diag(V))), and pmax() copies attributes from its first argument (the scalar 0), which dropped the parameter names; name-based indexing of the result then silently returned NA. Fixed by ordering the parameter vector first (sqrt(pmax(diag(V), 0))), with a regression test on the naming contract.

Usability

define_model_component(): covariates now defaults to NULL, so a pure measurement item no longer needs covariates = NULL spelled out. The documentation also makes the factor-assignment convention explicit (the length-n_factors loading_normalization vector: 0 off-factor, 1 anchor, NA free), and adds a factor_index convenience argument for assigning an item to a single factor (factor_index = 2, loading_normalization = 1 is shorthand for c(0, 1, 0) in a 3-factor model).
vcov_factorana() now verifies that the data it is given matches the data the model was fit on (same rows, same order) via a lightweight fingerprint stored on the fit, and errors with a clear message otherwise. This catches the "right shape, wrong rows" case that the free-parameter-count check could not.

factorana 1.7.0

New features

simulate_factor_model() generates data from a fitted or fully specified factorana model, following the legacy simulation algorithm: for each simulated unit a base row is drawn from data (with optional weights) to supply the covariates, the latent factor(s) are drawn from the model's unconditional distribution, the latent type is drawn from the type model, and each component's outcome is drawn from its model given the covariates and factors. Output is "full detail" like the legacy simulation_data.csv: the resampled covariates, the drawn factors (and type / mixture component when present), and per component the linear predictor, the shock, the choice probabilities, and the simulated outcome. The outcome column is named after the component's outcome variable so the simulated data is directly re-estimable with the same model system.

Supported: all four model types (linear, probit, binary and multinomial logit, ordered probit); latent types (n_types > 1); mixtures of normals (n_mixtures > 1); and the correlated and structural-equation (SE_linear/SE_quadratic/SE_interactions/SE_full) factor structures. Validated by parameter recovery (simulate from known parameters, re-estimate, recover) for every model type and factor structure. Endogenous regressors and the goodness-of-fit mode are not yet implemented.

factorana 1.6.0

New features

Bootstrap inference, designed for the "estimate one stage at a time, across many bootstrap samples, on a compute cluster" workflow (matching the legacy backend). Resampling uses integer frequency weights (cluster or row multiplicities) rather than physically duplicating rows: estimating with these weights is numerically identical to estimating on the expanded data (verified) and reuses the existing per-observation weight machinery, so a cluster drawn k times simply gets weight k with no duplicate-id bookkeeping.

Four building blocks, all restartable and distributable:
- generate_bootstrap_samples() draws and persists the resampling weights once (reproducible across nodes and restarts).
- bootstrap_fit_sample() estimates one stage for one sample and writes stage_dir/sample_<id>.rds; it skips if that file exists, so each (stage, sample) is an idempotent unit of work to scatter across a cluster and resume after interruptions. Zero-weight rows are dropped and the rest carry their integer frequency weight; set previous_stage for later stages to chain each replicate on its own earlier-stage fit.
- collect_bootstrap() reads the finished samples and returns the bootstrap covariance, bootstrap standard errors, and percentile confidence intervals.
- bootstrap_factorana() is a single-node convenience driver over the above; bootstrap_factorana_multistage() drives a chained multi-stage bootstrap in one call (looping samples x stages, chaining each replicate on its own earlier stage, skipping later stages when an earlier one fails to converge).
For panel / two-stage models the bootstrap is the honest route to standard errors that account for first-stage (measurement) uncertainty, which the sandwich estimator omits.

factorana 1.5.0

New features

Robust and cluster-robust standard errors via a new post-hoc vcov_factorana(object, data, type = c("hessian", "robust", "cluster"), cluster = ...), with a robust_se() convenience wrapper. The estimator is the sandwich V = H^{-1} B H^{-1}: the bread H^{-1} is the inverse observed information already computed at fit time, and the meat B is built from the per-observation scores of the marginal (factor-integrated) log-likelihood (B = sum_i g_i g_i' for robust, B = sum_c s_c s_c' for cluster). A new C++ routine (evaluate_obs_scores_cpp) returns the per-observation score matrix; it reuses the score already formed inside the likelihood and is only computed on request, so estimation speed is unchanged. Validated against sandwich::vcovHC and sandwich::vcovCL on a GLM-equivalent model to within machine precision.

Notes: one factorana row is one independent unit whose measurements are integrated jointly over the latent factors, so clustering is meaningful only when a cluster groups several rows (households, schools, or stacked person-wave rows in a long-format second stage); with one row per cluster the cluster estimator reduces to the robust one. For two-stage fits these standard errors treat the first-stage parameters and plug-in factor scores as known and so understate uncertainty; a both-stages bootstrap (planned) is the honest route there.

factorana 1.4.1

Bug fixes

Ordered probit now rejects mis-coded outcomes at component construction instead of silently corrupting the likelihood at estimation. The C++ ordered-probit evaluator uses the outcome value directly as the category index (1..num_choices) and indexes the threshold parameters by it, but the component does not store its data and the estimation pipeline does not recode the outcome. A value above num_choices therefore read threshold parameters out of bounds (into the next component's parameters), and a 0-indexed value was off by one, in both cases producing a wrong log-likelihood with no error. define_model_component() now requires an exact contiguous 1..num_choices coding on the evaluation subset and stops with an actionable message (naming the offending value or empty category) otherwise. The previous category-count check only caught a subset of these cases.
Multinomial logit: fixed a spurious "longer object length is not a multiple of shorter object length" warning in the contiguous-outcome check, which recycled vectors of different lengths. The check now compares lengths first. Standard multinomial logit already rejected non-contiguous, 0-indexed, and empty-category codings; regression tests now lock that behavior in alongside the new ordered-probit guard.

Note: native support for estimating ordered-probit or multinomial-logit models with one or more empty categories (warn-and-collapse rather than error) is not yet implemented; it requires recoding the outcome to a contiguous set in the estimation pipeline, since the component does not retain its data.

factorana 1.4.0

New features

define_factor_model() gains two structural-equation factor structures that add cross-product (interaction) terms between input factors, giving parity with the factor_spec = "interactions" / "full" options already available for observed components:
- factor_structure = "SE_interactions": linear plus cross-product terms, f_out = a + sum_j a_j f_j + sum_{a<b} a_ab f_a f_b + epsilon.
- factor_structure = "SE_full": linear, quadratic, and cross-product terms together.
The cross-product coefficients are named se_interaction_<a>_<b> for input factors a < b (one per distinct pair, i.e. choose(n_input, 2) of them). The analytical gradient and Hessian integrate the cross-products through the same Gauss-Hermite quadrature already used for the linear and quadratic terms, preserving full-information consistency. fix_factor_param(), equality constraints, se_covariates, latent types, and two-stage previous_stage estimation all work with the new structures.

Cross-product terms require at least two input factors (n_factors >= 3). With a single input factor there are no distinct pairs to interact, so "SE_interactions" silently downgrades to "SE_linear" and "SE_full" to "SE_quadratic" with a one-line message, mirroring the factor_spec downgrade behavior of define_model_component().

factorana 1.3.4 (2026-05-12)

Test infrastructure

tests/testthat/test-interaction-factors.R: the two FD-vs-analytical numerical checks for the 3-factor linear model with full second-order terms (Tests 21 and 22) now call skip_on_cran(), matching the rest of the file (every other test there already skips on CRAN). The analytical gradient and Hessian are unchanged; the tests still run locally and on win-builder, so developer correctness coverage is preserved.

Reason: under alternative BLAS implementations (ATLAS in particular) the finite-difference reference accumulates floating-point round-off differently, which on one element of the analytical Hessian pushed the max relative error from roughly 0.98e-3 to 1.02e-3 against a 1e-3 tolerance. The analytical Hessian itself is correct; the discrepancy is in the FD reference, not in factorana. This change prevents the spurious failure flagged on the CRAN ATLAS check farm.

factorana 1.3.3

Bug fixes

The 1.3.2 warn-and-skip for stale previous_stage / init_params names was incomplete: setup_parameter_constraints() emitted the warning and skipped the stale slots in its per-parameter branch logic, but the unfiltered full_init_params vector was still passed to the C++ estimator, which errored with "Fixed values size mismatch" because SetParameterConstraintsWithValues enforces init_values.size() == nparam.

estimate_model_rcpp() now reconciles full_init_params with the current model's canonical parameter layout before any C++ call: stale names (present in the anchor but not in the current model) are dropped with a single warning that lists the first 10 offending names, and missing names (present in the current model but not in the anchor) are filled from a fresh initialize_parameters() pass. Works for both initializer-built and user-supplied init_params.

Triggering scenario: anchor a Stage 2 model on a Stage 1 result, and Stage 2 has fewer components than Stage 1 (e.g., because 1.3.1's warn-and-skip on define_model_component() dropped a component whose evaluation_indicator is all zero for the current wave). Reported by the MH-trap pipeline at the wave-to-wave handoff.

factorana 1.3.2

Changes in behavior

setup_parameter_constraints() now warns once and skips previous_stage / init_params parameter names that are not in the current model, instead of detonating inside the per-parameter branch logic. This mirrors the warn-and-skip applied to define_model_component() in 1.3.1: panel-data pipelines that build one anchor and reuse it across waves with slightly different component lists no longer crash on the names the current model does not recognize. The warning lists the first 10 stale names and a total count.

Migration: callers that rely on the prior error to detect typos in previous_stage anchors should grep for the new "Skipping N previous_stage / init_params name(s)" warning.

factorana 1.3.1

Changes in behavior

define_model_component() previously errored with "Evaluation subset has zero rows" when its evaluation_indicator selected no rows. This was a pipeline-halting failure mode for panel data with wave- or cohort-specific item availability, where a given component contributes nothing in some waves but should simply be dropped. The function now emits a warning and returns NULL in that case. define_model_system() filters NULL entries out of the components list automatically; an end-to-end pipeline that builds many components and lets some come back empty no longer needs to branch around it. The hard error remains for the truly-empty-input case (data already has zero rows before the eval filter).

fix_coefficient(NULL, ...) returns NULL silently so chained component-mutation calls propagate the skip without crashing.

Migration: callers that catch the old error message should switch to checking is.null() on the component or to letting define_model_system() filter NULLs.

factorana 1.3.0

New features

fix_factor_param(factor_model, name, value) constrains a parameter in the latent-factor distribution (factor variances, SE-equation slopes / intercept / type-specific intercepts / residual variance, type-probability intercepts and loadings, factor-mean covariate coefficients, SE covariate coefficients) to a fixed value at model- definition time. The fixed parameter appears in result$estimates at the user-supplied value with result$std_errors == 0 and is excluded from the optimizer's free-parameter vector.

The most common use case is setting a type_<t>_loading_<k> to 0 when factor k is known a priori not to enter the type-probability model (e.g., regime-switching models where one latent dimension drives the regime probability and others do not). Estimating such a loading free leaks identification, walks the optimizer along a flat ridge, and inflates the standard errors of the remaining type-model parameters.

API: single-fix and named-vector batch forms, fix_factor_param(fm, "type_2_loading_2", 0) or fix_factor_param(fm, c(type_2_loading_2 = 0, type_2_loading_3 = 0)). Pass value = NA_real_ to release a previously fixed parameter. Idempotent at 0 for the SE-outcome-factor type loading (auto-fixed there); errors on any non-zero value for that slot. Overrides the auto-fix of non-identified factor variances.

When combined with define_model_system(previous_stage = ..., free_params = ...): fix_factor_param() always wins. A name listed in both free_params and fix_factor_param() stays fixed, with a one-time warning per conflict to surface accidental misuse. A value in previous_stage$estimates for the same name is overridden, with an informational warning.

print(factor_model) echoes the fixed factor-distribution parameters.

Bug fixes

Bundles the parameter-mapping bug fixes that were committed but never shipped to CRAN (1.2.1 + 1.2.2 + 1.2.3 in the local history). All three addressed silent-wrong-estimate paths when factor_structure = "SE_linear" or "SE_quadratic" was combined with n_types > 1 and either se_covariates or factor_covariates:

R/C++ parameter-ordering desync. The R initializer (R/initialize_parameters.R, R/optimize_model.R::build_parameter_metadata) and the C++ FactorModel constructor disagreed on whether type-model parameters (typeprob_*_intercept, type_*_loading_*) come before or after factor_mean_* and se_cov_*. The R side now matches the constructor: factor_var → SE → typeprob/type_loading → factor_mean → se_cov.
src/rcpp_interface.cpp::initialize_factor_model_cpp: param_offset and type_param_start recomputed in the constructor-true order so the auto-fix logic for the outcome-factor type loading writes to the right slot.
src/FactorModel.cpp::SetFactorMeanCovariates: factor_mean_param_start is now nparam at the call site (the actual append point), not a precomputed offset that ignored typeprob / mixture slots.
src/rcpp_interface.cpp::initialize_factor_model_cpp un-fix loop: the name-to-index map for define_model_system(previous_stage = ..., free_params = ...) recognized only factor_var_*, mix_*, se_intercept, se_linear_*, se_quadratic_*, se_intercept_type_*, and se_residual_var. Names like typeprob_*_intercept, type_*_loading_*, factor_mean_*_*, and se_cov_* silently failed the lookup and stayed fixed in C++ while R's setup_parameter_constraints correctly marked them free. The optimizer ran on a smaller free-set than R expected, and R's estimates[free_idx] <- estimates_free recycled, producing a clean k-cycle of repeated values across covariates and type params. Reported as the MH-trap regime model "7-cycle" (17 free factor-distribution slots collapsing to 7 unique values).
src/rcpp_interface.cpp equality_constraints map had the same coverage gap and a typeprob_<t>_intercept -> type_<t>_intercept rename typo. Beyond making equality constraints on those names unrecognised, the missing factor_mean and se_cov slots also caused the iterator to under-count factor-level positions, so equality constraints on component-level loadings, sigmas, or thresholds were silently bound to factor-distribution slots whenever factor_covariates or se_covariates were used.

Regression tests at: tests/testthat/test-two-stage-se-types.R::"Constrained re-run via free_params: SE-cov / typeprob / type_loading slots get distinct values", tests/testthat/test-equality-constraints.R::"equality_constraints + se_covariates does not corrupt component idx mapping", and tests/testthat/test-fix-factor-param.R::* for the new feature.

factorana 1.2.3

Combined release covering the parameter-mapping bugs introduced by the SE_linear / SE_quadratic + n_types > 1 + se_covariates / factor_covariates combinations. Bundles the 1.2.1 fixes (R/C++ parameter-ordering desync, factor_mean_param_start collision, and the un-fix loop name-coverage gap that produced a clean k-cycle of recycled values when callers used define_model_system(previous_stage = ..., free_params = ...) against a matching SE structure) with the 1.2.2 fix (equality_constraints map name-coverage gap, which silently bound component-level constraints to factor-distribution slots when factor_covariates or se_covariates were also present). See the 1.2.1 and 1.2.2 sections below for full descriptions and regression-test pointers.

factorana 1.2.2

Bug fixes

src/rcpp_interface.cpp::initialize_factor_model_cpp parameter-name to index map for the equality_constraints lookup had the same name-coverage gap as the un-fix loop fixed in 1.2.1. The factor-level portion of the map enumerated only factor_var_*, mix_*, se_intercept, se_linear_*, se_quadratic_*, se_intercept_type_*, and se_residual_var. It also contained type_<t>_intercept (which is a typo: the actual parameter is named typeprob_<t>_intercept), and was missing every type_<t>_loading_<k>, factor_mean_<k>_<cov>, and se_cov_<cov> entry. Beyond making equality constraints on those names unrecognised, the missing factor-mean and SE-covariate slots caused the iterator to under- count factor-level positions, so every component-level index added to the map after the factor block was off by (n_factor_mean + n_se_cov). As a result, equality constraints on component-level loadings, sigmas, or thresholds silently bound to factor-distribution slots whenever factor_covariates or se_covariates were used.

The map is now built in the same constructor-true layout used by the un-fix loop and the FactorModel constructor: factor_var* -> [factor_corr_*] -> mix_means / mix_logweight -> se_* -> typeprob_*_intercept / type_*_loading_* -> factor_mean_<k>_<cov> -> se_cov_<cov> -> component model params. A regression test in tests/testthat/test-equality-constraints.R constructs an SE_linear + se_covariates = c("X") model with equality_constraints tying cross-wave loadings and sigmas, and asserts that tied parameters share values to 1e-10 (binding the correct measurement slots) and that se_cov_X recovers its DGP value (slot is not shifted).

factorana 1.2.1

Bug fixes

factor_structure = "SE_linear" (or "SE_quadratic") combined with n_types > 1 and either se_covariates, factor_covariates, or both, silently scrambled gradients, Hessians, and final parameter estimates due to a parameter-layout disagreement between the R initializer and the C++ FactorModel constructor. The C++ constructor laid out parameters as [factor_var, se_*, typeprob/type_loading, factor_mean_*, se_cov_*, model] while the R initializer (R/initialize_parameters.R, R/optimize_model.R::build_parameter_metadata) laid them out as [factor_var, se_*, factor_mean_*, se_cov_*, typeprob/type_loading, model]. Every gradient and Hessian element involving a covariate slot or a type-model slot was permuted, and the final result$estimates named vector held values belonging to the wrong slots. The R initializer and build_parameter_metadata now place type-model parameters immediately after the SE block, before factor_mean_* and se_cov_*, matching the C++ constructor.
src/rcpp_interface.cpp::initialize_factor_model_cpp computed param_offset and type_param_start in the same incorrect order (covariates before type params) used by the R initializer. The param_fixed_vec and the auto-fix logic for the outcome-factor type loading were therefore writing to the wrong indices, marking se_cov_* slots as fixed instead of type_*_loading_<n_factors>. The offsets are now computed in the same constructor-true order so the fixed-vs-free bookkeeping aligns with the actual parameter positions.
src/FactorModel.cpp::SetFactorMeanCovariates set factor_mean_param_start = n_input_factors + nse_param (or nfac for non-SE structures), which assumed no type-model or mixture parameters preceded the factor-mean block. Whenever n_types > 1 or n_mixtures > 1 was used together with factor_covariates, the factor-mean block start collided with the typeprob / mixture-mean block. The start index is now nparam at the time of the call, which is the actual append point.
src/rcpp_interface.cpp::initialize_factor_model_cpp un-fix loop for define_model_system(previous_stage = ..., free_params = ...) in the matching-structure path used a fac_name_idx map that recognized only factor_var_*, mix_*, se_intercept, se_linear_*, se_quadratic_*, se_intercept_type_*, and se_residual_var. Names like typeprob_*_intercept, type_*_loading_*, factor_mean_*_*, and se_cov_* silently failed the lookup and remained fixed on the C++ side while the R-side setup_parameter_constraints correctly marked them free. The C++ optimizer then ran on a smaller free-set than R expected, and R's estimates[free_idx] <- estimates_free triggered vector-recycling, producing a perfect k-cycle of repeated values across the longer-than-actual R free index. Reported by the MH-trap regime model (17 free factor-distribution slots collapsed to 7 unique values). The name-to-index map now covers every factor-distribution parameter type in the constructor-true layout. A regression test guards the matching-structure / free_params path with se_covariates plus n_types > 1.

factorana 1.2.0 (2026-04-22)

Bug fixes

Stage-2 SE workflow (define_model_system(..., previous_stage = ...) with factor_structure = "SE_linear" or "SE_quadratic") was silently fixing typeprob_*_intercept and type_*_loading_* parameters when the previous stage already had n_types > 1. The factor_dist_patterns regex list in define_model_system() enumerated only factor_var_*, se_*, chol_*, and factor_mean_* as factor-distribution parameters, so typeprob and type-loading slots fell through into measurement_params and were forced to their Stage 1 values. The C++ initialization correctly left those parameters free. The disagreement scrambled the mapping between the C++ free-parameter vector and the R-side free_idx: evaluate_likelihood_rcpp() extracted 7 free gradient entries from C++ and scattered them into a 5-slot R map, shifting every value past position 5. The result was a Hessian whose SE x SE sub-block looked permuted (max rel_err ~1.67, previously flagged as the "TEST 4 known issue"). Added ^typeprob_ and ^type_[0-9]+_loading_ to the pattern list; now the two sides agree that those are factor-level free parameters. Also matches TEST 1's explicit expectation that typeprob and input-factor type loadings stay free in Stage 2.
Type-probability mixture contribution to the Hessian on the d^2 L_mix / d(sigma^2_k)^2 diagonal was missing two terms in FactorModel::CalcLkhd (section 3b):
1. The cross term 2 * dpi_t/d(sigma^2_k) * dL_t/d(sigma^2_k) was added only once rather than twice. The off-diagonal code correctly added both orderings of the cross term; the k == l branch kept a "we only add once due to symmetry in sum" comment that was wrong. On the diagonal the two orderings coincide so the correct behavior is a factor of 2.
2. The chain-rule second derivative of the type probability through the GH scaling of the input factor itself: when f_k = sigma_k * x_q, the full diagonal second derivative of pi_t w.r.t. sigma^2_k is d^2(pi)/d(f_k)^2 * (df_k/d(sigma^2_k))^2 + d(pi)/d(f_k) * d^2(f_k)/d(sigma^2_k)^2. Only the first term was being accumulated; the d(pi)/d(f_k) * d^2(f_k)/d(sigma^2_k)^2 term is now added when k == l. In combination with the free/fixed mapping fix, the Stage-1 with-types to Stage-2 SE_linear Hessian FD max rel_err dropped from 1.67 to ~8e-6 at n_quad = 12, reaching machine precision at n_quad >= 24. The previously skipped TEST 4 in test-two-stage-se-types.R has been re-enabled as a regression guard.

factorana 1.1.8

Minor changes

define_model_component(): the no-intercept sanity warning now fires only when the caller has NOT explicitly opted out of an intercept via intercept = FALSE. The explicit opt-out is a deliberate choice (common in validation and shape-only tests and in ordered-probit setups where the intercept is absorbed into the cutpoints); treating it as a candidate for the misspecification warning produced noise without signal. Accidental omissions (intercept left at its default of TRUE, but no intercept covariate provided) still trigger the warning.
Attempted re-enable of the Stage-1-with-types / Stage-2-SE_linear Hessian FD placeholder (test-two-stage-se-types.R TEST 4): still fails after the v1.1.7 Hessian accumulation fix (max err ~1.7 on the SE x SE sub-block, gradient passes). The remaining mismatch is a separate issue in how the type-probability model interacts with SE_linear under previous_stage, not covered by the general accumulation fix. Re-skipped with an updated comment.

factorana 1.1.7

Bug fixes

Analytical Hessian now accumulates correctly under equality constraints. Previously FactorModel::CalcLkhd iterated over freeparlist for its Hessian contribution loops, which skipped equality-tied (derived) parameters. The subsequent ExtractFreeHessian aggregation had no tied-position values to sum into the primaries, so the analytical Hessian at primary x primary positions missed the d^2L / d(primary) d(derived), d^2L / d(derived) d(primary), and d^2L / d(derived) d(derived) terms. This produced analytical zeros at positions where finite differences (with equality enforced at reinitialisation) reported magnitudes of 10 to 50, meaning SE estimates under equality constraints were biased. Fix: iterate over gradparlist (free plus tied) in the Hessian loops and symmetrise full_hessL before ExtractFreeHessian. Validated by the now-passing Stage 1 FD tests in test-dynamic-single-factor.R (linear max_err ~2.5e-6, oprobit Hessian max_err ~3.3e-6).
The C++ name-to-index map for ordered-probit thresholds was looking up the field n_categories instead of the actual component field num_choices, so threshold equality constraints were silently dropped. Fixed in rcpp_interface.cpp. This was the root cause of the conv = 1 + "items to replace" warnings observed in the Mental Health Trap simulation's Stage 1 oprobit estimation.

New features

define_dynamic_measurement() for model_type = "oprobit" now ties only the threshold INCREMENTS (_thresh_k for k = 2..K-1) across periods and leaves _thresh_1 period-specific, mirroring the linear strategy (tie sigmas, free intercepts). Patch contributed by an external agent on the MH Trap project.
define_dynamic_measurement() now silently strips "intercept" / "constant" from covariates when model_type = "oprobit" (ordered probit absorbs the intercept into the cutpoints). The default covariates = "intercept" now works for every supported model type.

New tests

test-se-models.R: .build_se_type_model gains an indicator_type argument ("linear" or "oprobit") and two new tests exercise single-stage SE_linear + n_types = 2 FD (gradient and Hessian) with ordered-probit indicators.
test-two-stage-se-types.R: TEST 3 adds an oprobit variant of the two-stage FD test at Stage 2 (analog of the linear TEST 2). The old TEST 3 known-issue placeholder is renumbered to TEST 4.
test-dynamic-single-factor.R:
- TEST 4: structural test for the oprobit wrapper path (equality constraints, no estimation).
- TEST 5: oprobit Stage 1 converges cleanly with tied thresholds (regression guard for the threshold-name-lookup bug).
- TEST 6-7: Stage 1 FD (gradient + Hessian) for the dynamic-measurement wrapper at a non-MLE parameter point, linear and oprobit. These tests actively exercise the Hessian-accumulation fix.
- TEST 8: oprobit two-stage structural parameter recovery. Became feasible once the two C++ bugs were fixed; recovery of factor_var_1, se_intercept, se_linear_1, se_residual_var is within tolerance at n = 2500.

factorana 1.1.6

Bug fixes

define_dynamic_measurement() with model_type = "oprobit" now ties only the threshold INCREMENTS across periods (_thresh_k for k = 2..K-1) and leaves _thresh_1 period-specific. Previously all thresholds were tied, which forced any wave-to-wave shift in the latent factor mean into the factor variances and produced Stage 1 convergence code 1 (false convergence) plus boundary warnings. The new behaviour mirrors the linear case: tie the scale (sigmas / threshold increments), leave the location (intercepts / thresh_1) period-specific. Patch contributed by an external agent working on the Mental Health Trap simulation, where Stage 1 convergence went from conv = 1 with factor_var_1 = 4.06 / factor_var_2 = 0.98 to conv = 0 with balanced variances.
define_dynamic_measurement() now silently strips "intercept" or "constant" from covariates when model_type = "oprobit". Ordered probit absorbs the intercept into the cutpoints, so factorana rejects an intercept covariate on oprobit components. The wrapper's default covariates = "intercept" now works for every supported model type without requiring the user to tailor it by model type.

Tests

test-dynamic-single-factor.R gains a structural test for the oprobit wrapper path that verifies: the intercept covariate is stripped, equality constraints contain only threshold increments k = 2..K-1 (not thresh_1), thresh_1 appears as a free parameter in every period, and build_dynamic_previous_stage() produces a dummy with no _intercept parameters and with the anchor-period thresh_1 carried into every period's slot. Estimation-level recovery for oprobit is not asserted: the oprobit dynamic model is empirically fragile at moderate n and identification is tracked separately.

factorana 1.1.5

Bug fix

build_dynamic_previous_stage() now handles oprobit, probit, and logit model types. Those types do not have explicit _intercept parameters (location is absorbed into cutpoints or the link function); the previous version would error when looking up a missing intercept. Linear behaviour is unchanged.

Tests

test-dynamic-single-factor.R gains a third test that exercises the wrapper plus Stage 2 SE_linear with n_types = 2. Types shift the period-2 factor mean via se_intercept_type_2. Recovery of the well-identified structural parameters (factor_var_1, se_linear_1, se_intercept_type_2, se_residual_var) is verified on a simulated DGP, with a more generous tolerance on se_intercept (which trades off against typeprob_2_intercept and type_2_loading_1 when measurement information density is low).

factorana 1.1.4

New features

New helper functions define_dynamic_measurement() and build_dynamic_previous_stage() encapsulate the standard workflow for estimating an SE_linear or SE_quadratic structural model on a single latent construct observed at two or more time points:
- define_dynamic_measurement() builds the Stage 1 measurement model (a k-factor independent system with loadings and residual sigmas tied across periods via equality_constraints, measurement intercepts left period-specific).
- build_dynamic_previous_stage() constructs a Stage 2 previous_stage object that plugs the anchor-period (wave 1 by default) intercepts into every factor slot. This anchors the measurement level under the factor-identification convention E[f_k] = 0 and lets the observed period-to-period mean shift in Y identify the structural intercept (alpha) in Stage 2.
- Supports model_type of "linear", "oprobit", "probit", and "logit" with appropriate tying of thresholds (oprobit) or sigmas (linear).
Refactored tests/testthat/test-dynamic-single-factor.R to use the wrapper; recovery results are unchanged.
Refactored vignettes/dynamic_structural.Rmd to use the wrapper.

factorana 1.1.3

New tests and documentation

New tests/testthat/test-dynamic-single-factor.R covers the standard workflow for estimating an SE_linear dynamic structural equation on a single latent construct measured at two time points:
- Stage 1 fits a 2-factor independent measurement model with equality_constraints tying factor loadings and residual sigmas across periods but leaving measurement intercepts period-specific.
- A dummy 2-factor previous_stage object is built that carries the wave-1 intercepts into both factor slots (discarding the wave-2 intercepts, which absorb the structural-equation mean shift).
- Stage 2 fits SE_linear and recovers the structural intercept se_intercept (alpha), slope se_linear_1 (beta), residual variance se_residual_var, and input factor variance factor_var_1.
New vignette vignettes/dynamic_structural.Rmd walks through the same workflow with executable code and explains the motivation for using wave-1 intercepts in Stage 2.
Removed the parameter-recovery test from test-two-stage-se-types.R (its Stage-1-no-types -> Stage-2-with-types setup is not a canonical workflow); the shape test, FD gradient/Hessian test, and the skipped Stage-1-with-types known-issue placeholder remain in place.

factorana 1.1.2

Bug fixes

Two-stage estimation with previous_stage + SE_linear/SE_quadratic and n_types > 1 now correctly builds the Stage 2 parameter vector. Prior versions omitted the typeprob_* and type_*_loading_* slots, causing either a crash in setup_parameter_constraints() or silently mis-fixed parameters. The measurement-parameter filter in initialize_parameters() was also tightened so that factor-level type parameters from Stage 1 are no longer duplicated into the measurement block. Discovered during analysis of a structural model where types are introduced at Stage 2.

Tests

New tests/testthat/test-two-stage-se-types.R adds:
- a shape test that verifies Stage 2 SE_linear + n_types = 2 produces the expected parameter vector aligned with build_parameter_metadata(),
- a finite-difference gradient and Hessian check at the DGP parameters,
- a structural parameter recovery test (se_linear_1, se_intercept_type_2, se_residual_var, factor_var_1) with init se_intercept = -0.5 (Stage 1 absorbs E[f2] into the measurement intercepts, so the MLE se_intercept is negative even if the DGP constant is 0), and
- a skipped placeholder documenting a known Hessian-FD mismatch in the Stage-1-with-types -> Stage-2-SE_linear variant (not the common workflow; tracked for a future fix).

factorana 1.1.1

CRAN resubmission fixes

Added method references (with DOIs) to the DESCRIPTION field: Heckman, Humphries & Veramendi (2016, 2018) and Humphries, Joensen & Veramendi (2024).
Added \value (via @return) to all exported functions that were missing it, including as_kv, estimate_and_write, write_model_config_csv, the adaptive-quadrature and observation-weight setters, and every print method.
Replaced \dontrun{} with runnable examples (fix_coefficient, fix_type_intercepts) or \donttest{} blocks (results_table, results_to_latex, components_table, estimate_factorscores_rcpp, cleanup_parallel_workers).
estimate_and_write() and write_model_config_csv() no longer write to a default path; results_dir / file is now required (use tempdir() in examples and tests).
Replaced the non-executing introduction vignette with two executable vignettes: measurement_system (two-factor CFA) and roy_model (sector choice with a latent ability factor).

Bug fixes

Fixed systematic-suite test that constructed an ordered-probit component with intercept in its covariates; this configuration is rejected (the intercept is absorbed into the cut points).

factorana 1.0.2

Improvements

CRAN compliance fixes
Added introductory vignette
Improved test coverage for SE models, equality constraints, observation weights
Documentation improvements: fixed adaptive integration formula in README
Updated SE_linear example to use larger sample size for reliable convergence

factorana 1.0.1

Bug Fixes

Fix binary logit initialization and add dedicated test

factorana 1.0.0

New Features

Structural equation models (SE_linear, SE_quadratic) for causal factor relationships
Mixture of normals factor distribution (n_mixtures = 1, 2, or 3)
Equality constraints for measurement invariance via equality_constraints parameter
Component-level type control via use_types parameter
Observation weights for survey weights or importance sampling
Checkpointing for long-running estimations via checkpoint_file parameter
Exploded multinomial logit for ranked choice models
Exploded nested logit with exclude_chosen = FALSE
Rank-share corrections via rankshare_var parameter

Core Features

Multi-factor models with flexible loading normalization
Linear, probit, ordered probit, and multinomial logit model components
Analytical gradients and Hessians for fast convergence
Parallel estimation via doParallel for large datasets
Multi-stage (sequential) estimation with fixed early-stage parameters
Adaptive integration for efficient two-stage estimation
Factor interaction terms (quadratic and cross-product) via factor_spec
Correlated two-factor models via factor_structure = "correlation"