Building data processing pipelines for documents processing with NLP using Apache NiFi and related services