23456785 of 42
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Learnable Cross-Modal Adapter for Industrial Fault Detection Using Pretrained Vision Models
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science. EnergyVille, Genk, 3600, Belgium.ORCID iD: 0000-0002-5229-1140
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0002-4390-411X
Blekinge Institute of Technology, Faculty of Computing, Department of Computer Science.ORCID iD: 0000-0002-6309-2892
Blekinge Institute of Technology, Faculty of Computing, Department of Software Engineering.ORCID iD: 0000-0001-9336-4361
Show others and affiliations
2026 (English)In: IEEE Transactions on Industrial Informatics, ISSN 1551-3203, E-ISSN 1941-0050Article in journal (Refereed) Epub ahead of print
Abstract [en]

Automatic fault detection and diagnosis (FDD) are critical for maintaining reliable and efficient industrial systems. However, conventional methods rely heavily on manual inspections or threshold-based techniques, which often fail to capture the dynamic patterns in time series (TS) sensor data. As a result, faults persist for extended periods, leading to suboptimal system operations, increased energy waste, and significant economic losses. This work proposes a cross-modal framework that facilitates the efficient deployment of state-of-the-art pretrained vision models for enhanced FDD, with two novel TS-to-image transformations: first, an adapter deep encoder that learns optimal, task-specific representations from raw sensor data while generating outputs that are input-compliant with pretrained models. Second, an enhanced line plot that creates geometric shapes of two related signals. Comparative experiments against fixed methods, including spectrograms, Gramian angular fields, Markov transition fields, recurrence plots, and five deep learning baseline models, showed substantial performance gains across diverse domains. InceptionTime achieved the highest average baseline performance with an F<inf>1</inf> of 88.6%, while the adapter and shapes achieved 94.4% and 92.4%, respectively. The findings highlight the potential of the cross-modal framework for FDD to facilitate early intervention and efficient system maintenance in industrial settings.

Place, publisher, year, edition, pages
IEEE Computer Society, 2026.
Keywords [en]
Cross-modal adaptation, deep learning, fault detection and diagnosis (FDD), pretrained vision models, time series (TS), transfer learning (TL)
National Category
Artificial Intelligence Industrial engineering and management
Identifiers
URN: urn:nbn:se:bth-29203DOI: 10.1109/TII.2026.3659264ISI: 001691144300001Scopus ID: 2-s2.0-105030196076OAI: oai:DiVA.org:bth-29203DiVA, id: diva2:2042282
Available from: 2026-02-27 Created: 2026-02-27 Last updated: 2026-02-27Bibliographically approved

Open Access in DiVA

fulltext(3451 kB)16 downloads
File information
File name FULLTEXT01.pdfFile size 3451 kBChecksum SHA-512
076021590c70a7bdee5742c2f0183a0a8c0dc401a0b71315872391f267c460ca7dfcf421089d3fe0a5c9f7ef16bb0af1783d99d404bdbf5405dbf3188172ada7
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

van Dreven, JonneCheddad, AbbasAlawadi, SadiGhazi, Ahmad Nauman

Search in DiVA

By author/editor
van Dreven, JonneCheddad, AbbasAlawadi, SadiGhazi, Ahmad Nauman
By organisation
Department of Computer ScienceDepartment of Software Engineering
In the same journal
IEEE Transactions on Industrial Informatics
Artificial IntelligenceIndustrial engineering and management

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 964 hits
23456785 of 42
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf