Ellamind on synthetic data generation with distilabel for pipelining and LLM finetuning