LaVOZs

The World’s Largest Online Community for Developers

'; Two adjacent fields rendered as one in XML with unrecognized separator — how can I split them in Google Sheets IMPORTXML? - LavOzs.Com

I'm attempting to scrape a site's list of news articles, capturing topic, headline, author, and date published into Google Sheets using IMPORTXML. I've got the first two, but the last two are assembled somewhat confusingly.

The website has a page where all of its stories are listed chronologically. In the source of that page, the author and date published are rendered thus within a div:

By <span class="post-item-river__byline___mU1tP author vcard"><a class="byline-link url fn n" href="https://www.fakeurlgoeshere.com">Author Name</a></span><time class="post-item-river__date___1Dcq1 entry-date published" datetime="20XX-XX-XXTXX:XX:XX-XX:XX">Date Published</time>

How this displays on site: Author Name·Date Published

How this displays when scraped in IMPORTXML: Author NameDate Published

I would like Author Name and Time Published to be recognized as separate fields. How do I accomplish this?

I have attempted multiple arguments, including trying numerous variants of div/time arguments, but those don't seem to have worked, with the output always returning 'Imported content is empty'.

Related
importXML in google spreadsheet two separated tags
'importxml' - importing XML won't work in Google Sheets
How to use importxml in google sheets to import item hidden by scroll into view
tsql how to split xml and insert them as rows
Google sheets Import XML (importxml) error
Google Sheets ImportXml() imported content can not be parsed
Trying to extract image URLs from XML using IMPORTXML in Google Sheet
How to pull “publisher” data from BGG - Importxml - google sheets
How to include a Google Sheets cell number in XPath for ImportXML
How to include a Google Sheets cell reference in URL for ImportXML