Skip to content

denistol/media_parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Media Parser

Media Parser is a project for parsing news from popular Ukrainian news websites. It extracts news headlines and their links using CSS selectors to navigate the web page structure.

Features

  • News scraping from multiple Ukrainian sources.
  • Easy configuration using CSS selectors for element extraction.
  • Support for popular Ukrainian news sites such as 5 Kanal, Ukrainska Pravda, TSN, RBK-Ukraine, and others.
  • Suitable for further data processing or integration into other projects.

Supported Sources

The project supports the following news websites:

Example Object for Scraping:

{
    "url": "https://www.5.ua/",
    "name": "5 Kanal",
    "container_selector": ".nf-latestnews",
    "item_selector": ".nf-latestnews-post",
    "title_selector": ".nf-latestnews-post-lnk"
}

About

Media Parser

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages