Submitted by Ggronne t3_10tbfjq in MachineLearning
Ggronne OP t1_j7aj3co wrote
Reply to comment by matth0x01 in Information Retrieval book recommendations? [D] by Ggronne
I have written small web scrapers for different applications, but none were based on theory. An upcoming project requires more extensive information retrieval and I would therefore like to get a better foundation.
I will start with Introduction to Information Retrieval, thanks!
I will start with Introduction to Information Retrieval; thanks!
matth0x01 t1_j7ayc9e wrote
Seems that you are more interested on the crawling and ETL side.
Maybe you should look more into Data warehouse or Data lake literatur. Especially the shift in paradigm from ETL (extract, transform, load) to ELT (extract, load, transform) respectively schema-on-read.
Ggronne OP t1_j7bg7r1 wrote
Thanks! Can you recommend any good resources for ELT (and ETL)?
matth0x01 t1_j7c3smm wrote
Sorry, my library seems a bit outdated on that side.
But the one from Wikipedia looks great at first sight. Ralph., Kimball (2004). The data warehouse ETL toolkit : practical techniques for extracting, cleaning, conforming, and delivering data
Viewing a single comment thread. View all comments