strict-haskell

Strict Haskell

What would a strict version of Haskell look like? That is, a pure language with syntax as close to Haskell as possible but where function application and data constructors are evaluated strictly. Laziness should be accessible through an explicit thunk datatype. (Nikita Volkov has written a good explanation of the explicit thunk datatype.)

What problems are we trying to solve?

There are a lot of implicit thunks floating around in Haskell. It is anathema to treat A -> IO B as if it were simply A -> B so it seems odd for A to mean a type of computations (returning values of type A) rather than itself a type of values. Strict Haskell would be more honest since thunks would be represented by an explicit type.
The need for newtype is symptomatic of the confusion introduced by implicit thunks.
The need for evaluate :: a -> IO a is a wart which indicates that we do not understand how to mix IO and forcing thunks.
When we have a function which releases some memory how can we ensure that the release happens before the rest of the program continues execution? In the pure case it seems possible to use CPS and in IO it seems possible to use evaluate (and probably the CPS form too) but is there a unified, principled way? It seems unlikely that we will find one whilst thunks are implicit.
Lots of libraries (Lens, ST, Map, ByteString, Text) have strict and lazy versions of functions or data structures. Lots of monads have strict and lazy versions. This is very confusing. Is it really necessary?
For a Functor f, a value x :: f a often contains one or several thunks of type a. However, there is no general way to force these thunks. If we apply a large number of fmaps to x then even if we force x it doesn’t mean the thunks inside have been forced. We can experience a space leak this way. This is a demonstration seq is implicitly part of the interface of every data type, but it is generally not treated that way by datatype authors.

Indications that explict thunk types are needed

Drawbacks I anticipate with strictness

Note that monadic bind does not have an obvious drawback in a strict regime. With m a -> (a -> m b) -> m b the m b need not actually run anything if the m a short circuits, because it’s already hidden behind a function (imagine Maybe for a prototypical case).

The Applicative instances will be potentially more problematic. With (<*>) :: f (a -> b) -> f a -> f b it will be hard (syntactically at the very least) to stop the f a computation from running, even if the f (a -> b) short circuits. It will also only make sense to use lazy Traversible instances. Mapping an Applicative value creating function over an entire Traversible is a waste when it is followed by a sequence which can short circuit. Perhaps the true nature of (<*>) is LazyTuple (f (a -> b), f a) -> f b.

Is Haskell’s IO itself lazy out of necessity? Maybe. Consider a pure function containing an expression let x = f y :: IO A. Then x is an IO action ready to be run, but not run yet! IO certainly contains some sort of delaying and I don’t understand the importance of this to the whole strict Haskell issue.

Hughes

Would this language be worse than Haskell? Hughes talks about the importance of laziness for modularity but I think that the issues he raises only actually apply to lazy data structures.

Augustsson

Augustsson raises some important points, though a lot of his examples are based on the behaviour of error which in my opinion is just wrong. error is an effect and has no place in pure code.

Lazy bindings

The “error” example is just wrong. The \a -> a + expensive example can be solved in a strict language through an explicit thunk data structures.

Lazy functions

Augustsson’s charge of lazy functions is a harder one to answer. Again he levels the false criticism of error, but the point still stands. I think his idea of using {exp} for a thunk that when called evaluates exp is a good one.

Lazy constructors

I don’t understand Augustsson’s point about lazy constructors. Does “lazy constructor” mean “lazy datatype”?

Cyclic data structures

I think these are solvable with explicit thunks.

Reuse

I don’t understand Augustsson’s point here at all. His “biggest gripe” is about composability, but it seems trivially solvable with lazy data structures. He does seem to realise this: in the comments he says “It doesn’t matter if ‘or’ and ‘map’ are overloaded for different kinds of collections, it’s still wrong for strict collections”.

Bob Harper makes an interesting point in the comments:

‘In my mind however the conditional branch is not an example of [laziness]. The expression “if e then e1 else e2” is short-hand for a case analysis, which binds a variable (of unit type) in each branch. So the conditional is not “lazy” at all; rather, it is an instance of the general rule that we do not evaluate under binders (ie, evaluate only closed code).’

Kmett

Edward Kmett raises some good points and some that I don’t understand.

I think one thing that he is hinting at is that in m >>= f the tail call is not f, rather (>>=). Thus if m is evaluated strictly (in our setup, that corresponds to “not wrapped in a thunk”) and is a large chain of calls of the same structure, then a lot of stack space will be consumed. (Kmett mentions traversable rather than a chain of monadic binds, but I suspect its the same issue).

Kmett later reiterated the issue and mentioned similar ones. See the examples starting with

“Monads are toys due to the aforementioned restriction.”
“You wind up with issues like SI-3295”

Equational reasoning

I don’t really understand how laziness (or probably more accurately, non-strictness) is relevant to equational reasoning. I would like to find some explicit examples. Perhaps “Theorems for Free”?

Value restriction

I really don’t understand the relevance of this at all.

Code that can be understood in pieces

This is basically just lazy bindings. I get why it’s good, but would hope to find a syntax which makes it less relevant.

Elegant expression of certain algorithms

This is a lazy data structures only comment, as far as I can tell.

Control structures

This is exactly the same as Augustsson’s point above.

Monads

Kmett claims monads only work with laziness. I don’t understand this.

Being Lazy with Class

The “Being Lazy With Class” paper mentions two important uses of laziness:

Recursive datastructures – this is the same as Augustsson’s “cyclic data structures argument” but better made. Is this easily replaceable with a lightweight thunking syntax?
Unusual control flow – this is the same as Augusstson’s point, and Kmett’s “Control Structures” point

Other

Don Syme notes in “Initializing Mutually Referential Abstract Objects: The Value Recursion Challenge” that “Wadler et al. describe techniques to add on-demand computations to strict languages.”

A CUFP paper notes that Mu has demonstrated strictness to be harmful to modularity.

The H2 Wiki