π§π· π Hi Guys, for the past few weeks I'm trying to develop a rag model that can retrieve information from pdfs, but my first issue started with pdfs that contain charts and images. Does anyone knows how to extract all the data to a data frame using offline open source packages? And how can I retrieve this raw files together with the summarized embeddings?