Re: Understanding Word and Sentence Boundaries

help-gnu-emacs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Understanding Word and Sentence Boundaries

From:	ken
Subject:	Re: Understanding Word and Sentence Boundaries
Date:	Tue, 11 Dec 2012 10:17:36 -0500
User-agent:	Mozilla/5.0 (X11; Linux i686; rv:10.0.11) Gecko/20121120 Thunderbird/10.0.11

On 12/11/2012 07:03 AM Eric Abrahamsen wrote:

ken<gebser@mousecar.com>  writes:

On 06/26/2010 11:05 PM Deniz Dogan wrote:

2010/6/27 ken<gebser@mousecar.com>:


On 06/26/2010 06:53 AM Paul Drummond wrote:

Thanks for the responses guys.

....

I presume that Emacs hackers either a) put up with it or b) spend a lot
of time fixing each case until they are happy.

I suspect the answer is b. ;-)

I wish there was a single minor-mode that fixes all the word boundary
issues for every major-mode I use!  I can but dream.   Or maybe I will
get round to doing it myself one day!  ;)

Cheers,
Paul Drummond



Is it possible to specify word boundaries for a particular mode?


Yes, it's part of the syntax table. See e.g. `modify-syntax-entry'.


Thanks for the pointer to that function.

The behavior I see in need of repair is the role of so-called "comments"
in sentence syntax.</tag>   For instance, immediately before this
sentence are two spaces... which should signify the end of the
previous sentence.  But functions like "forward-sentence" and
"fill-paragraph" and "backward-sentence" don't recognize it.

Said another way, the "</tag>" string obscures the relationship
between the period before it and the two spaces after it and so fails
to see that one sentence ends and another starts.  This occurs in
text-mode and seems to be inherited by other modes.

If I'm reading "modify-syntax-entry" correctly, the default meanings
of '<' and'>' are, respectively, beginning and end of comment, so
modifying them wouldn't fix this problem.  Or can this be remedied by
a change in the syntax table?  Or is this a bug?


For this particular case, I think you can modify the value of the
`sentence-end' variable (which is returned by the `sentence-end'
function? The whole thing is a little confusing). You'd probably be best
off starting with the docstring for the sentence-end function, and
working back from there.

I think the `sentence-end' variable is automatically buffer-local, which
means if you change it in a mode-hook it ought to work the way you want.
I agree that the whole syntax thing feels like a very well-polished
hack.

E


Eric,

Yes, that would be the variable to adjust. I took a hard look at it anddiscussed it (I believe) on this list years ago, but never came up witha fix. As I see it, there are two problems:

First, "one" of the items in that RE would need to be "zero or moreconsecutive instances of '<' followed by any number of other charactersup until the next '>' is found." E.g., the RE would need to be able tofind the end of this sentence</b></i>.)</q></p></span></div> ThoughI've used REs successfully in quite a few instances and so with a smallbit of help could probably figure that part out, there's a second issue.

My considered opinion is that in the above and similar examples, the endof the sentence is immediately after the period ('.')... or questionmark, exclamation mark, etc. and not after the </div>. That is wherethe point should go when forward-sentence is executed. This means thatno RE would work because, once it finds the RE-defined sentence-end, itthen needs to go backwards within the found string until it encounters[.!?]+ and then search forward again to the first character after. IOW,unless I'm missing some capability of REs, "sentence-end" needs to be afunction rather than an RE and would be a different function than onewhich finds the beginning of a sentence.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: Understanding Word and Sentence Boundaries, ken, 2012/12/11
- Re: Understanding Word and Sentence Boundaries, Eric Abrahamsen, 2012/12/11
  - Re: Understanding Word and Sentence Boundaries, ken <=
    - Re: Understanding Word and Sentence Boundaries, Eric Abrahamsen, 2012/12/12
    - Finding end of sentence[ was Re: Understanding ... Sentence Boundaries], ken, 2012/12/12
    - Re: Finding end of sentence[ was Re: Understanding ... Sentence Boundaries], Eric Abrahamsen, 2012/12/12
    - Re: Finding end of sentence[ was Re: Understanding ... Sentence Boundaries], Eric Abrahamsen, 2012/12/13

Prev by Date: RE: Newlines in file names
Next by Date: Re: Newlines in file names
Previous by thread: Re: Understanding Word and Sentence Boundaries
Next by thread: Re: Understanding Word and Sentence Boundaries
Index(es):
- Date
- Thread