NiFi processors catalog
This page is a working inventory of custom processors and processor scripts shipped in this repository.
Scope:
Python extension processors (
FlowFileTransform) innifi/user_python_extensions/script-based processors used with
ExecuteStreamCommandinnifi/user_scripts/processors/
Python extension processors
Processor class |
Source file |
Purpose |
Key properties |
|---|---|---|---|
|
|
Rewrites Avro binary field(s) to base64 string values and updates output Avro schema accordingly. |
|
|
|
Remaps JSON records using a mapping schema file, including nested and composite field handling. |
|
|
|
Extracts numeric IDs from JSON records and stores them in FlowFile attributes ( |
|
|
|
Converts Parquet FlowFile content to NDJSON output. |
None |
|
|
Normalizes OCR/MedCAT service responses into a consistent JSON output shape. |
|
|
|
Prepares records for NLP service requests as |
|
|
|
Prepares records for OCR service requests with |
|
|
|
Adds geolocation ( |
|
|
|
Reassembles ordered blob fragments and decompresses Cerner LZW payloads. |
|
|
|
Reference/sample processor template for implementing new processors. |
|
Script-based processors (ExecuteStreamCommand)
Script |
Source file |
Purpose |
Typical arguments |
|---|---|---|---|
|
|
Cleans PII-like patterns from text fields in JSON records. |
|
|
|
Converts Parquet bytes from stdin to NDJSON on stdout. |
None |
|
|
Reassembles + decompresses Cerner blob parts and emits merged JSON record. |
|
|
|
Reads files (and optional CSV metadata) from storage folders and emits JSON records for ingestion. |
|
|
|
Adds random geolocation points for records using configured city polygons. |
|
|
|
Cohort export utility; builds cohort aggregation artifacts from patient and annotation files. |
|
|
|
Test/dummy cohort data generator from patient and annotation input files. |
input file paths and patient/annotation field names |
|
|
Experimental schema conversion helper for Elasticsearch mappings. |
|
Notes
Some scripts in
nifi/user_scripts/processors/are stream processors; others are batch-style utilities.sample_processor.pyis a template/reference implementation, not a production flow processor.Keep this page updated when adding/removing processor files so developers can discover what is available.