Kenneth Gitere
|
e5a318282d
|
Update img tags with new src values to point to the local files
|
2020-05-02 19:06:03 +03:00 |
|
Kenneth Gitere
|
78ba40f57a
|
Add image download functionality
|
2020-05-02 18:33:45 +03:00 |
|
Kenneth Gitere
|
f24e72e70f
|
Change signature of extract_content to copy the reference to article DOM
node instead of writing to file
|
2020-05-02 14:51:53 +03:00 |
|
Kenneth Gitere
|
529704d227
|
Add test for extract content
|
2020-05-01 20:42:41 +03:00 |
|
Kenneth Gitere
|
b5336e078d
|
Factor out text extraction into extractor module
|
2020-05-01 16:17:59 +03:00 |
|
Kenneth Gitere
|
4527fb07d9
|
Initial extraction code to get meta information on a blog
|
2020-04-30 11:05:53 +03:00 |
|