Unexpected behaviour with empty lines and Prism

RuslanI · October 3, 2021, 11:32am

In my markdown file I have included prism js and css files that I downloaded from Prism website. I have specifically included the “Normalize Whitespace” plugin.
If I don’t have any white spaces in the code block, everything works fine, like this:

<pre class="language-python line-numbers"><code>df_employees.loc['478'] # returns series
df_employees.loc[['478']] # returns dataframe</code></pre>

And it’s properly rendered in the final markdown like this:

Screen Shot 2021-10-03 at 18.27.40

But if I add an empty space in the code block

<pre class="language-python line-numbers"><code>df_employees.loc['478'] # returns series

df_employees.loc[['478']] # returns dataframe</code></pre>

then the text gets messed up unpredictably. It can be like this:

Screen Shot 2021-10-03 at 18.29.37

or like this in the issue I raised in their repo on github

github.com/PrismJS/prism

Empty line inside code block garbles the output

opened 11:24AM - 26 Sep 21 UTC

ruslaniv

cannot reproduce

**Information:** - Prism version: latest from the website - Plugins: line-hi…ghlight line-numbers autolinker show-language previewers normalize-whitespace toolbar copy-to-clipboard - Environment: Built in browser in DevonThink 3.7.2 **Description** If I add an empty line inside a code block then the output text gets garbled. It is still readable but it is shifted downward and line numbers are out of place. The screenshot shows the exact behavior. If I remove an empty line or put a character in it, then the code is rendered as it is supposed to. **Example** 1. Code block with an empty line ``` <pre class="language-python line-numbers"><code> def quality(total_bill,tip): if tip/total_bill > 0.25: return "Generous" else: return "Other" </code></pre> ``` That is how it is rendered ![Screen Shot 2021-09-26 at 18 19 23](https://user-images.githubusercontent.com/7631314/134805501-87b57634-7021-4093-a038-621a2385157e.png) 2. Code block without an empty line ``` <pre class="language-python line-numbers"><code> def quality(total_bill,tip): if tip/total_bill > 0.25: return "Generous" # else: return "Other" </code></pre> ``` Screenshot ![Screen Shot 2021-09-26 at 18 20 56](https://user-images.githubusercontent.com/7631314/134805548-c4c316ec-f122-4b68-ae3d-069e3d52d1e6.png)

Is there anything I can do to troubleshoot the issue, since having empty spaces in the code blocks would be really helpfull for me.

chrillek · October 3, 2021, 12:55pm

If I understand you correctly, you’re not using markdown but HTML pre elements (in your markdown) to render code.

Which means that your question is not related to markdown nor to DT. It is about the role and behavior of the pre elements in HTML.
In your case, instead of showing screenshots of the HTML, you might want to look at it in the developer tools of your browser.
As the other person noted on Github, something is off in your example. But nobody can figure out what without more context.

RuslanI · October 3, 2021, 2:42pm

But there is no browser involved. That’s what I see inside DT, after I created a new markdown file, put this at the beginning of the file:

<link href="/Assets/prism.css" rel="stylesheet">
<link href="/Assets/prismCustom.css" rel="stylesheet">
<script src="/Assets/prism.js"></script>

then entered the block of text / code shown in my previous example and then clicked
View → Document Display → Preview

chrillek · October 3, 2021, 3:11pm

There’s at least a part of a browser involved, you just do not see it: it is the built-in Webview of DT. Convert your MD to HTML in DT and open that in a browser.

Then you can use the developer tools to see

what the generated HTML looks like
which styles from which style sheets determine the look of it

And, seriously: I would not take the detour of MD if I only wanted to use HTML. MD can not do everything that HTML can, but displaying code blocks it certainly can. Just put them in three backticks
```python (or whatever language your code is written in)
code goes here
```
Mixing MD and HTML unneccessarily is adding more complexity.

RuslanI · November 5, 2021, 12:36pm

Well, I really like the visual styling that Prism provides - first of all syntax highlighting, of course, then line numbers, copy button, etc.

And it looks like I found the culprit, thanks to your advice to convert MD to HTML.

Here is the Source view of my MD document:

<pre><code class="language-sql line-numbers">
SELECT * FROM customers

LEFT JOIN orders
    ON customers.id = orders.customer_id;
</code></pre>

and that’s how it was rendered in the Preview view:
Screen Shot 2021-11-05 at 19.27.50

Having converted it to HTML, here is the way this code block was converted:

SELECT * FROM customers

<p>LEFT JOIN orders
ON customers.id = orders.customer_id;
</code></pre></p>

as you can see, DT (i think) is inserting a pair of P tags.

Once these P tags are removed, then finally DT displays the code block as it is supposed to:
Screen Shot 2021-11-05 at 19.33.40

Now I need to find out why these P tags are inserted and what is inserting them.

cgrunenberg · November 5, 2021, 12:48pm

How does the source of the Markdown document look like?

Bernardo_V · November 5, 2021, 1:15pm

This is related to MMD and not Prism or DT, I think.

RuslanI · November 5, 2021, 1:23pm

Here is the complete Source view of the markdown file:

<link href="/Assets/prism.css" rel="stylesheet">
<script src="/Assets/prism.js"></script>
<link href="/Assets/prismCustom.css" rel="stylesheet">

<pre><code class="language-sql line-numbers">
SELECT * FROM customers
LEFT JOIN 
	
	orders ON customers.id = orders.customer_id
</code></pre>

The first two prism files are downloaded directly from prism web site (syntax highlighting, line-numbers plugin, etc), the third one is just my custom CSS with styling i like (kind of my own Dark theme).

chrillek · November 5, 2021, 1:23pm

Why do you use HTML instead of code fences?

RuslanI · November 5, 2021, 1:28pm

I want syntax highlighting and line numbering that Prims provides and for that I have to wrap code in these tags.

RuslanI · November 6, 2021, 7:17am

You mean that since in MMD a return character designates a paragraph break, that’s the reason the P tags get injected into the HTML file whenever there is a return character?
That does make sense, but I wonder if there is way around it - I mean using MD and Prism in DT the way I would like?

chrillek · November 6, 2021, 12:47pm

It all depends on the Markdown parser used by DT. I have a hunch that it might be Multimarkdown 6, and if that one decides to add p elements like you see here – well, that’s that. You could use a <br> instead of a newline, though – that works.

On the other hand, I’d vote for using fenced code blocks instead, because they are easier to read and recognize and require less markup, so less risk of error. It would be possible to use line numbers in this case with one of the following methods

manually (ok, just kidding)
use a tiny JavaScript that takes care of it. It is actually trivial to insert into the MD document, but unfortunately it does not work right now due to a bug in DT’s handling of user-defined JS scripts. What you can do however, is to add the script to the source code of every MD file where you need it like so
<script>document.addEventListener('DOMContentLoaded', () => { const code = document.getElementsByTagName('code'); for (let c of code) { c.classList.add('line-numbers') } });</script>
make the MD processor pick up more than just the language string following the code fence, something like ```language-sql line-numbers or
```language-sql {line-numbers} or whatever – there are tons of suggestions out there.

The last point illustrates the main problem: There is nothing like MD proper. There are a ton of dialects now, and one would have to decide which one to choose. Apparently, DT went with Multi Markdown, which is as underspecified as any of the other dialects. To put it more diplomatic: There does not seem to be a formal grammar for any of the current MD dialects that would allow users to figure out how they behave in which circumstances.