Why Netscape shows ? instead of ' 104
RandySC writes " Demoronizer
is a Perl program which corrects numerous errors and
incompatibilities in HTML generated by, or edited with,
Microsoft applications. The demoroniser keeps you from
looking dumber than a bag of dirt when your Web page is
viewed by a user on a non-Microsoft platform.
A little detective work revealed that, as is usually the case when you encounter something shoddy in the vicinity of a
computer, Microsoft incompetence and gratuitous incompatibility were to blame. Western language HTML documents are
written in the ISO 8859-1 Latin-1 character set, with a specified set of escapes for special characters. Blithely ignoring this
prescription, as usual, Microsoft use their own "extension" to Latin-1, in which a variety of characters which do not appear in
Latin-1 are inserted in the range 0x82 through 0x95--this having the merit of being incompatible with both Latin-1 and
Unicode, which reserve this region for additional control characters.
" So now we know what happened to Jon.
Accept only correct HTML on your WebServer! (Score:1)
Yes! (Score:1)
Good god, that is lame (Score:1)
Yes, including your post.
No quotes??? (Score:1)
Netscape and IE bug only. (Score:1)
Perhaps somebody needs to fix this in mozilla.
That's not quite what the problem is (Score:1)
we don't need such a program (Score:1)
I remember reading an article a while back where Gates told the reporter that MS wasn't in the business of fixing bugs and handling bug reports..
"User's don't want bug fixes, they want new features"
This could be a hoax and all, but it seems valid enough given their BugFix Track record.
My Point is: Post your bug report, but download demoronizer anyway, because it's gonna be quite a wait.
JWitt
MS post a patch in a week????!! (Score:1)
1. Ignore it, hoping it goes away
2. Deny the existence of said bug
3. Promise a patch in the next release
4. Maybe fix, maybe not; definitely introduce new bugs
# find
find: cannot open
Cold hard facts make MS look bad. (Score:1)
Bad examples. (Score:1)
But, you belittle slashdotters in so saying, by assuming that we are idiots who don't already know this simple fact. Thus you are not following your own advice.
And your examples were terrible. ODBC?! Puh-lease! ODBC is very hard to use compared to the SQL it is trying to supplant. It is *not* an example of an improvement.
And why on earth do you assume that a world in which the linux machines are only on the server and the clients are all Windows is a good goal? This is not a worthy goal. We can already do that today. Allowing Linux on the clients is a good goal. Making MS coexist with Linux is also an impossible goal unless we can raise awareness of MS's incompatabilities so the public no longer tolarates them. Getting Windows to play nice with other platforms is essentially what this article was about, if you will recall.
In an ideal world, the choice of what OS to have on the desktop would *not* have to be dictated from on high just to get stuff to be compatable. I have no problem with systems where the users *can* use Windows and most end up choosing to use Windows. I have a problem with systems where the users have no choice but to use Windows in order to work with their Windows-using co-workers. (This is also why I don't like the propsed network for the ISS - allowing Windows is one thing, but forcing it on people for artificial reasons is another entirely.)
Isn't this a little partisan? (Score:1)
Having said that, maybe RandySC's bon mots are a wee bit OTT, but if you've experienced any of today's Threads From Hell [slashdot.org] (or read the non-OTT part of his submission), maybe you can excuse someone's MS-Disgust-o-Meter® running in the red.
Disclaimer: I'm asleep.
Rants Are Optional
--
It looks awful... (Score:1)
Craig
Blame W3C, too... (Score:2)
But the HTML spec has a glaring lack that motivates this violation in the first place: no curved quotes and apostrophes, and no em-dash.
Now, HTML is supposed to display by default in a proportional font, like printed matter (it's easier to read, among other advantages). But proportional fonts always use curved, symmetric double and single quotes.
Likewise proportional fonts always distinguish between a hyphen and a dash; most, in fact, have two dashes (the endash and the emdash) of slightly different widths, in addition to the hyphen.
But the HTML spec (and ISO8859-1) assumes the broken ASCII/Typewriter usage, which in proportional fonts is jarring and ugly. Font specs should be designed by people who know something about fonts, not by engineers!
The situation is potentially worse in other languages, though I'm not sure how the other ISO-8859-x specs handle it. In German, for example, the opening double and single quotes are traditionally at the bottom of the print line rather than the top, in addition to being reverse-curved, and French uses "guillaumettes", which look like doubled marks.
Search the Web for things like ampersand-emdash-semicolon and ampersand-lquot-semicolon -- which are attempts to address the problem -- and you'll see that this gaping mistake in HTML/ISO8859-1 bothers a lot of people.
So yeah, blame Microsoft for a kluge that works on only maybe four out of five of the web-surfing PCs out there. But complain to the ISO and to W3C for their oversight, too.
Craig
Good god, that is lame (Score:1)
The Windows-1004 codepage -- not on Linux either (Score:1)
-----
Not true at all (Score:1)
--
Poor M$ HTML!=Perl (Score:1)
Daniel
Convert to a mod_perl script, and use it on /. (Score:1)
Now, most of the HTML produced by Rob's slash scripts is pretty good, but there are a few complaints I have (basically, most of the page winds up on one bigass-long line, making debugging the HTML code (as when I was trying to figure out which Netscape bug the "ask slashdot" header triggers) annoying in the extreme) that this seems to clear up - it doesn't just go through and convert windows-specific quotation characters to standards-based equivalents; among other things, it will wrap HTML lines so that you can read the code later.
Actually, if some enhancements were made to this (say, applying a standard sorting to text-level markup and eliminating consecutive open/close tags), it could be used as a rather nice HTML cleaner; for example, getting something that turns:
This is a test
into:
This is a test
Or even something as awful as:
This is a test
would be very nice - I've noticed many odd tag-ordering problems in all sorts of auto-generated HTML. Turning crufty HTML into something that renders identically but will pass a weblint test would be very nice indeed.
No it doesn't. (Score:1)
<P><UL>This text is in a paragraph.
</UL></P>
becomes:
<p>This text is in a paragraph.
It leaves other occurences of </P> alone. (well, it also removes empty <P></P> paragraphs entirely) I agree, though, that it probably shouldn't be removing the </P> in this case; as other people have pointed out, it's explicitly not required in HTML 4.0 or 3.2, but I think that in the 3.0 draft (that all the browser vendors rejected because they thought it was too hard - such a shame too, because I was really looking forward to MathML) you are correct. Even if it was never required, </P> is a good idea.
Also, after looking at the actual script, I take back my comment about it being a good starting point for a general HTML-cleaner. Such an HTML cleaner is still a good idea, but it should probably be written from scratch.
Isn't this a little partisan? (Score:1)
" A little detective work revealed that, as is usually the case when you encounter something shoddy in the vicinity of a computer, Microsoft incompetence and gratuitous incompatibility were to blame. "
True or not I think this may be pushing an anti MS viewpoint too far. Nor that I support MS. Yeah, I mostly agree with the DOJ view and consider MS has legally stepped over the line. But a news portal ought to be a little more careful about presenting fair and consistent views from the editorial staff, and they ought to be as impartial as possible. This was nowhere near that goal.
More mal-rendered or mal-encoded HTML... (Score:1)
Barnes & Noble
instead of
Barnes & Noble
which seems to work ok, until you remove the space following the entity:
Barnes&Noble
is sometimes rendered as
Barnes&Noble
Who is to blame in this case?
PS. It sure is fun to describe HTML in HTML, especially when the slashdot code enters into the picture.
PPS: The following is encoded as Barnes&Noble. What does your browser do with it?
Barnes&Noble
--
Ohhh, I've wondered about this for a while (Score:1)
I've been wondering what the damn ? thing is. (Score:1)
I should have known Micro$loth would have something to do with it.
Broken (Score:1)
New filesystems, new encrypted passwords, new efforts to block *nix dns servers, incompatible fonts, office 97 file formats.
Linux would be less a threat to MS if they acted like part of the community instead of trying to grind it into the mud methinks.
distinguishing between the sinner and the sin (Score:1)
The Problem it is trying to solve: standard access to multiple databases so that a program can be configured for any particular database needed.
The MS solution: Design only the library interface and the SQL administration level and require custom, database specific drivers for each database a program has to talk to. Absolutely avoid creating a standard protocol since that would mean that anyone could write a non-proprietary driver. Charge money for ODBC drivers and encourage database vendors to do the same.
PLEASE! ARGHHHH.....
We are finally seeing this madness come to an end slowly, but it has been a nightmare! How hard do you have to make databases??? Login, send a query, get results and status messages, bind data to variables, close.
Message Oriented (MOM) or Message Passing (MPI, Voyager, etc.) with XML requests and results will soon obliterate the stupidity... I hope!
sdw
Prefer it in Mozilla... (Score:1)
On any platform.
(However if you try to use the locale settings on Microsoft platforms, I bet that the quotes in question are pushed on you by default. So it may not be so simple..)
Regards,
Ben Tilly
Bad journalism (Score:1)
I like the idea of an Apache module that fixes things on the way out.
distinguishing between the sinner and the sin (Score:1)
I think a lot of people here are confusing ODBC with SQL. ODBC isn't SQL. ODBC is a layer that allows you to talk with a variety of data sources - SQL or not. You can use it access a variety of data sources - Oracle, Sybase, Informix, SQL-Server, Access, FoxPro, etc. All of which provide for very different protocols, and flavors of SQL.
In the end, it provides you with an environment where you can be less concerned about the protocol and how you've linked to the data source - and less concerned about how you stucture your queries (in that they are not custom coded for it).
Sure, you take a performance hit. But, it works pretty good, and allows you to quickly and easily change the data source on the back end of your application. Makes it nice for prototyping an App using a local db schema defined in Access, and then for roll out actually pointing at a db in Oracle, or SQL server...
And - there are some ODBC drivers that are freely available.
- Porter
Ever hear of "embrace and extend"? (Score:1)
--
Bill stole our web. (Score:1)
God knows the net sure could use more regulation and laws, right?
</p> is not required (Score:1)
Actually </p> is (and always has been) optional. The HTML 4.0 [w3.org] spec says:
Isn't this a little partisan? (Score:1)
Isn't this a little partisan? (Score:1)
Screw news; Slashdot is entertainment.
How bout a version for apache's mod_proxy (Score:1)
Ex Machina "From the Machine"
xm@GeekMafia.dynip.com [http://GeekMafia.dynip.com/]
email LAME sites (Score:1)
New Term -- "Microsoft-Enhanced Superior Standard" (Score:1)
How about MESS?
(Microsoft-Enhanced Superior Standard)
Jay (=
Good god, that is lame (Score:1)
Isn't this a little partisan? (Score:1)
But absolutely accurate.
If the anti-microsoft bias of slashdot annoys you, go somewhere else. Pro-Microsoft news outlets may become more difficult to find as MS's behavior becomes more flagrant and public ("you reap what you sow, Billy Boy"), but right now they're a dime a dozen and would love to add your mouse clicks to their statistics.
distinguishing between the sinner and the sin (Score:1)
I would rather raise awareness of the limitations of Microsoft stuff even as I help people work with it. It's on their desks, they need to do work, and they need to exchange stuff with other people.
This is probably one of the larger points that most people seem to be overlooking. I'm currently contracting with a nationwide bank, that has over 30,000 employees. All evil-MSism's aside, the TCO (total cost of ownership (support, installation, training, NOT merely the cost of the software)) would be staggering. You cannot merely absorb such a cost because of the Microsoft bad, Linux good mentality.
Phred is taking the proper approach. Informing people, showing them that there are better, cheaper alternatives. Contrary to popular belief, people aren't all lemmings. Show them, don't berate them, they will learn and they will become better for it.
Ah yes, the point of the article... (Score:1)
Well, if you had bothered to read all of the subsequent posts as the person you refer to had, you would have realised that the discussion had, as per the norm here on /., turned into a MS-bashing and was no longer referring to the original topic.
You are just a weak imitation of your evil twin in that campaign - Jackass
This is too easy... but I will say that the simpleminded, when unable to defend an idea with intelligent conversation resort to such tactics thus undermining any respect that they would have garnered.
Dammit! (Score:1)
Isn't this a little partisan? (Score:1)
He of course doesn't have to impartial, PC or place whatever other form of blandness constraint upon the writings he publishes on his own Web site, don't you agree?
Regardless, this feature of Microsoftware is indeed brain damaged and gratuitously incompatible with the existing standards.
The Windows-1004 codepage. (Score:1)
Without further ado, here it is. If your system is using the Windows-1004 codepage, you should see the actual symbols. Otherwise, you'll see garbage.
é 130 - slightly shortened comma
â 131 - forte (looks like an italic `f')
ä 132 - double comma (like double quote; like `,,')
à 133 - elipsis [spelling?] (like `...')
å 134 - dagger (like a cross or a `t')
ç 135 - double dagger (has two crosses in it)
ê 136 - circumflex or caret (like `^')
ë 137 - perthousand (like a `%' but two circles on bottom)
è 138 - uppercase sh (an `S' with an upside down `^' on top of it)
ï 139 - less than (like `î 140 - uppercase osh (ligature of `O' and `E'; looks like `OE')
ì 141 - unused/unknown
Ä 142 - uppercase zh (a 'Z' with an upside down `^' on top of it)
Å 143 - unused/unknown
É 144 - unused/unknown
æ 145 - open single quote (like a ` on Unix systems)
Æ 146 - close single quote (like a ' on Unix systems)
ô 147 - open double quote (like a `` on Unix systems)
ö 148 - close double quote (like a '' on Unix systems)
ò 149 - dot; reminds me of 007 on IBM codepage 437 and friends@
ODBC is good? (Score:1)
On the other hand, ODBC does adhere to the SQL standard in terms of the actual query language, which is nice for a change. I would have expected Microsoft to use something like FoxPro's old query language.
That said, I do not like ODBC.
Even if you don't want to use xfstt... (Score:1)
xfstt also works nicely, and on Linux, it doesn't use very much memory, because it uses memory-mapped files efficiently.
My port of xfstt to OS/2 (which you can get from hobbes.nmsu.edu when my website is down) isn't as efficient because OS/2 has no easy mmap(4) facility.
TrueType fonts render so much more nicely than Adobe Type 1 fonts or Speedo fonts at low resolutions. Baskerville 8 point actually looks nice when done with TrueType. Of course, you only get this benefit if your TrueType fonts have hints in them--in other words, using a tool to convert from Type 1 to TrueType won't help. You can read more about ``hints'' on Microsoft's website; search for ``fonts''.
OxymoronsI a little (Score:1)
Woolly character set control (Score:1)
distinguishing between the sinner and the sin (Score:1)
basically a pretty satisfied user of NT 3.51 and only run 4.0 because it's needed for a lot of the
new apps. But *I* am the boss of my machine, not Microsoft.
What I observe is that people feel obligated to stick with Microsoft and the whole range of their software. They kind of know what they'll get --
a lot of fairly complete but complicated pieces that are difficult to manage and offer far more functionality in a "flat" undifferentiated space
than they really know what to do with.
They then start to discover all the little gotchas, like Microsoft's shall we say cavalier attitude about the TCP/IP stack, their sincere but misguided attempt to steer HTML and so on.
Actually, there is one thing Microsoft has done that I have few qualms with, and that is ODBC. Sure it's slow and not well documented. It's almost like Microsoft didn't really want it to be accepted so that they could clear the way for their various lunges at "distributed" "object" management (ho ho). But ODBC was designed by some good people at Redmond who played nice with the other kids, paid attention to existing standards and user preferences, and produced something that has joined CGI and Perl as a true Web standard.
Sure, this is like Sun sponsoring Tcl. A pretty good analogy, that.
I would rather raise awareness of the limitations of Microsoft stuff even as I help people work with it. It's on their desks, they need to do work, and they need to exchange stuff with other people.
I just let them find out that I was running Perl scripts to manage Novell servers in 1993, and that there is a whole world of other alternatives out there. If they don't want Linux on their desktop -- and I don't want it there yet either -- I can have them gracefully accept it off in the corner, serving their offices by being a good-neighbor fence and conduit for the net. Plus it lets me use all the nice post-1993 486 machines that are orphaned out there.
Microsoft is big and round and we can throw rocks at it and stamp our little feet, or we can work around them and help people get actual value from their computer systems -- a big anxiety out there about that, by the way, if you haven't noticed. At least, unlike ten years ago, people aren't putting them in the closet because they can't use them. Now they just get stuck when Win 95/98 crashes a lot and starts not running the programs they used to.
Eventually we might even have Linux file and program servers running Samba over Fast Ethernet
networks (which eliminate the perceived delay in transferring files across a normal office net), and thin Microsoft clients that have just enough
local stuff to boot up and do basic things.
Sure, Front Page is a dog, but people are using it successfully to create Web pages, and who am I to get in the way of that. We have a big educational process ahead of us, that's for sure. But we don't get anywhere by asking everyone to join our Microsoft Whine Club first.
Just show 'em how it's done. They'll figure it out.
--------
DeBillAlizer (Score:1)
"One computer, one language -except we dont use it at all."
I tell you what,
Maybe let MS stop using HTML.
They will use MSML which is only viewable with MS apps.
IE 5 will use it and HTML,
but if a website wants to be viewed with IE6, it will have to be MSML.
It will have "the best lisence they can offer": GPF
That means you can use it for free unless it presents a competition to them.
not only that,
the new IIS servers will check each file they are transfering to see if its HTML compatable.
If it is, it will automaticly make it into MSML, for the users to enjoy it better!
(and ofcoyrse the IIS server 7 times faster)
Next thing to do,
is to create a new language for people,
"MS-english" and they will talk in that only.
It wil be slower and sometimes people die when using it and have to reboot themselves.
Go Microsoft!
Make our lives easier by making new and improved standrats!!!
Bill stole our web. (Score:1)
We all use the MS compatible standards because everyone else does,
and that's why we're losing the internet to them.
They say HTML is now MS thing, we say ok, lets quickly change netscape to fit.
But HTML was ours a long time before MS had it.
We cannot afford to lose anything.
I see people say that without Microsoft there would be no Internet!
I see people thinking MS invented email!
People that tell me that HTML was introduced by IE!
We should not allow non-standard to win.
We need to enforce the standard and take our internet back.
Something such as a "warning: non standard page"
on every non standrd page should be required by law.
same as "This product does not generate HTML but our version of HTML" too.
Standard did not exist for Microsoft to ruin,
but for us to use.
Good idea! (Score:1)
please post it to freshmeat or anywhere,
i'd like to have it too...
Computers that _don't_ run Windows? Go awaaay... (Score:1)
Arial vs. Helvetica, the battle continues... (Score:1)
We're talking about stupid web authors who think the whole world is ruled by Microsoft and think that 'Arial' has existed since before time began and isn't just a rip-off of Helvetica to circumvent copyright-laws (like the whole True-Type stuff).
Netscape 3.0 substituted the current proportional font when it did't find the indicated on, but the 4.0 browsers just use Courier. Not quite nice, if you ask me...
There's got to be a way to tell Netscape what font to use, but I didn't find anything in the Netscape.ad file. I guess you could tell X to map 'Arial' to 'Helvetica', but I seriously don't know how...
Poor M$ HTML!=Perl (Score:1)
1. Stupid enough to be using an M$ app to generate their HTML in the first place.
2. Lazy enough not to worry about the bugs in it, and consequently their HTML.
...then, nice and accurate as it might be, I fear a Perl script won't be what they're looking for. A huggy, friendly, 4.5Mb Winbloze app with 15 options screens might just whet their appetite
A story from real life... (Score:1)
Computers that _don't_ run Windows? Go awaaay... (Score:1)
Isn't this a little partisan? (Score:1)
And why should they be impartial?
It is interesting to see that in the last few decades, it has been increasingly touted that news sources should be impartial. It first affected newspapers with close afiliations with political parties, which in large numbers have claimed independence, and claimed being impartial.
It has spread out to all kinds of news sources.
But a news source is NEVER impartial.
It always to some extent reflect whatever the owners, and their appointed editors want it to. No matter how much they try to pretend they're impartial, what they do will always have a slant depending on their view of the world.
And claiming to be impartial just makes the bias harder to see. In that respect, claiming to be impartial does more harm than it does good: it make a lot of people swallow biased news because they aren't aware of the political or commercial afiliations of the news source.
If you want to look for an impartial news source, why don't you search for the fountain of youth while you're at it - you're just as likely to find either.
nothing new (Score:1)
De-commoditized(tm) characters (Score:1)
- Bill G~++@_&^*&$3
NO CARRIER
Not just Microsoft (Score:1)
This will increase our effort to educate the public
Whenever I see "Web Tips" columns saying you can put in quotes and dashes in a web page without resorting to Unicode or GIF, I write a letter to the author. Unfortunately, last time I did this the author replied, promised a correction, but the correction never showed up in print.