Publications | Vijay Prasad Javvadi

PAPER A Process-Metric-Based Software Defect Prediction at Scale: An Empirical Evaluation of Six Machine Learning Classifiers Across 296,457 File Instances from Five Open-Source Systems

Status: Under Review Venue: IEEE Access (Q2, IF 3.9) Manuscript ID: Access-2026-25027 Submitted: 8 Jun 2026 (resubmission) Expected decision: Aug–Nov 2026

Abstract. Uniform empirical comparison of six classifiers (LR, DT, RF, GB, XGB, MLP) on a 296,457-instance dataset assembled from five mature open-source systems (Elasticsearch, Spring Boot, Hadoop, Kafka, Express). Stratified 5-fold cross-validation with paired-t and Wilcoxon significance testing. Random Forest is best (AUC 0.8998, F1 binary 0.6355, F1 macro 0.7595); XGBoost is the closest competitor (AUC 0.8955).

Contributions. Multi-model Gini feature importance with explicit caveat on Strobl 2007 high-cardinality bias; documentation of the deployed model gb-paper1-v4-fixed_age; honest threats-to-validity disclosure including the file_age_days sign bug that contaminated 40% of rows in the prior revision and is now eliminated.

Dataset DOI → GitHub repository →

PAPER B Cross-Repository Software Defect Prediction: A Leave-One-Repository-Out Empirical Study Across Five Mature Open-Source Projects

Status: Under Review Venue: Wiley JSEP — J. Software: Evolution and Process (Q2, IF 1.8) Manuscript ID: 3489875 Submitted: 12 Jun 2026 Expected decision: Aug–Sep 2026

Abstract. Systematic leave-one-repository-out validation reveals an AUC–F1 asymmetry under cross-repository conditions and identifies defect-rate mismatch as the primary qualitative predictor of transfer degradation. Cross-repository AUC ranges 0.817–0.913 with mean 0.867; cross F1 ranges 0.525–0.707 with mean 0.631. The cross-within F1 gap shrinks to 0.027 after the file_age_days correction.

What's explicitly out of scope. No 20-pair pairwise matrix, no MMD distance analysis, and no domain-adaptation methods (TCA / CORAL / DANN). These are signposted as future work rather than claimed.

Dataset DOI →

PAPER C Risk-Based Software Test Prioritization via Machine-Learning Defect Prediction: An Empirical Top-k Coverage Study and CI/CD Integration Pattern

Status: Under Review Venue: IEEE Software Manuscript ID: SW-2026-05-0236 Submitted: 18 May 2026 ORCID: 0009-0004-1192-6906

Abstract. Top-k coverage analysis at k ∈ {10, 20, 30, 40, 50}% with risk-vs-random lift factors. Top 10% of files capture 43.82% of defects (4.37× lift, 81.4% of the oracle ceiling at the 18.61% base rate). Top 20% reaches 69.62%, top 30% reaches 83.64%. The production model gb-paper1-v4-fixed_age is integrated as a FastAPI microservice triggered by GitHub Actions with explicit advisory-vs-gating rollout criteria.

Honest disclosure. No production telemetry is yet instrumented; the paper describes the design pattern and the offline coverage substrate. A telemetry-grounded follow-up is in progress.

Dataset DOI →

PAPER D Self-Healing Test Automation via Tree-Ensemble Locator Ranking: An Empirical Evaluation on 2,400 Mutation Events Across Seven Refactoring Classes

Status: Under Review Venue: Wiley STVR — Software Testing, Verification & Reliability (Q2, IF 1.14) Manuscript ID: 1820539 Submitted: 14 Jun 2026 Expected decision: Aug–Sep 2026

Abstract. A framework that automatically recovers broken Selenium locators at runtime using DOM similarity analysis and tree-ensemble ranking over eight feature families (id-stability, role/aria, structural neighbourhood, text proximity, depth, attribute Jaccard, sibling-index, and Xpath-edit distance). Mutation-based evaluation pipeline simulates realistic UI evolution: 2,400 mutation events spanning seven refactoring classes.

Hybrid heuristic + ML ranking produces a deterministic top-1 candidate plus calibrated confidence used to gate auto-heal vs. flag-for-review.

Dataset DOI →

PAPER E LLM-Based Test Case Generation from Natural-Language Requirements: A Verified Multi-Domain Empirical Study with Symbolic Mutation Indicators

Status: Under Review · Editor assigned Venue: Springer Automated Software Engineering (Q2, IF 3.1) Submission ID: 3a23f22d-7d70-4d5e-8082-a6ce82c889c9 (manuscript ID pending) Submitted: 17 Jun 2026 (editor assigned 17 Jun 2026)

Abstract. The RAITG framework converts 312 requirements across three domains (banking, healthcare, e-commerce) into executable tests using an LLM plus deterministic rule-based verification. End-to-end results: 68% effort reduction (184h → 58.9h), requirement coverage 71.2% → 94.1%, and first-pass verification rate 96.3%. Symbolic mutation indicators are used as a domain-agnostic adequacy proxy.

Preprint (Research Square — not peer-reviewed) → RAITG corpus DOI →

PAPER F Post-Execution Defect Attribution: An Empirical Comparison of Four Methods on 6,000 Synthetic Failure Events

Status: Under Review · Editor assigned Venue: Springer EMSE — Empirical Software Engineering (Q1, IF 3.6) Manuscript ID: EMSE-S-26-00920 Submitted: 16 Jun 2026 (editor assigned 17 Jun 2026) Expected decision: Aug–Nov 2026

Abstract. An empirical comparison of four post-execution defect-attribution methods on 6,000 synthetic failure events, fusing dynamic test-failure signals (stack traces, failure clustering, suspect-set ranking) with repository priors from the defect-prediction model to produce actionable triage advice rather than ranked file lists.

Preprint (Research Square — not peer-reviewed) → Archival DOI →

bdd2pw bdd2pw: Live-DOM Page Object Scaffolding from Gherkin Specifications via the Microsoft Playwright MCP

Status: Under Review Venue: Elsevier SoftwareX (Q2, IF 3.4) Manuscript ID: SOFTX-S-26-00858 Submitted: 31 May 2026 Expected decision: Aug–Sep 2026

Summary. A tool paper for an npm package that scaffolds Playwright page objects from Gherkin specifications by driving the Microsoft Playwright MCP against the live DOM. A preprint is hosted on SSRN (preprint hosting; not peer-reviewed). The package is published as @vijaypjavvadi/bdd2pw (v3.8.1).

SSRN preprint (not peer-reviewed) → Software DOI → npm package →

sel2pw sel2pw: A Deterministic, AST-Based Migration Toolkit from Selenium Test Suites to Playwright TypeScript

Status: Under Review Venue: Elsevier Software Impacts (Q3) Manuscript ID: SIMPAC-D-26-00122 Submitted: 31 May 2026 Expected decision: Jul–Aug 2026

Summary. A tool paper for a deterministic, AST-based migration toolkit that converts Selenium test suites to Playwright TypeScript. Validated on 409 Java files with 0 conversion failures. Published as @vijaypjavvadi/sel2pw (v1.0.5). An SSRN preprint relay is pending.

Software DOI → npm package →

PAPER G1 Detecting Employment-Experience Fraud at Scale: A Deterministic Timeline Engine and Adversarial AI-Fake Robustness Study

Status: Submission-ready Venue: Taylor & Francis — Information Security Journal: A Global Perspective (UISS20) Manuscript ID: pending (assigned at submission) Target submission: late June 2026

Summary. A deterministic timeline engine for detecting employment-experience fraud at scale, paired with an adversarial robustness study against AI-generated fakes. The reproducibility bundle (TruthHire) provides the reference implementation, evaluation harness, and a public-corpus assessment (CC-BY-4.0).

TruthHire reproducibility DOI →