data-driven newsvendor, LLM-based feature discovery, semantic-to-source mapping, fractile adjustment, conditional value-at-risk